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Method for identifying and characterizing organisms. 

@ A method of characterizing an unknown organism 
species comprises determining the position of part or whole 
of evolutionarily conserved sequences in genetic material of 
said organism, relative to a known position of restriction 
endonuctease cleavage sites in said genetic material (other 
than by determining the chromatographic pattern of retriction 
endonuclease digested DNA from said unknown organism, 
which digested DNA has been hybridized or reassociated with 
ribosomal RNA information containing nucleic acid from or 
derived from a known probe organism), thereby to obtain an 
identifying genetic characterization of said unknown organ- 
ism, and comparing said characterization with information 
^ from at least two sets of identifying genetic characterizations 
^ derived from the same conserved sequences, each of said 
sets defining a known organism species. 

0 

n 

D 

SI * 

1. 
u 



Doydon Pnnfing Company Lid. 



01 20658 



METHOD FOR IDENTIFYING AND CHARACTERIZING ORGANISMS 

The present invention relates to a method for the 
rapid and accurate characterization and identification 
of organisms r including prokaryotic and eukaryotic 
organisms, such as bacteria, plants, and animals* 

5, The classification of living organisms has 

traditionally been done along more or less arbitrary 
and somewhat artificial lines. For example, the living 
world has been divided into two kingdoms: Plantae 
(plants) and Animal ia (animals). This classification 

10. works well for generally familiar organisms, but 

becomes difficult for such organisms as unicellular 
ones (e.g., green flagellates, bacteria, blue-green 
algae), since these differ in fundamental ways^ from the 
"plants" and ''animals" • 

15. It has been suggested to simply divide organisms ' 

with respect to the internal architecture of the 
cell. In this scheme, all cellular organisms are 
either prokaryotic or eukaryotic* Prokaryotes are less 
complex than eukaryotes, they lack internal compart- 
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mentalization by unit membrane systems r and lack a 
defined nucleus. Prokaryotic genetic information is 
carried in the cytoplasm on double-stranded/ circular 
DNA; no other DNA is present in cells (except for the 
^* possible presence of phage r bacterial viruses, and 
cirular DNA plasmids, capable of autonomous 
replication). Eukaryotes, on the other hand, have a 
multiplicity of unit membrane systems which serve to 
segregate many of the functional components into 
10* specialized and isolated regions. For example, genetic 
information (DNA) can be found in a well- 
compartmentalized nucleus and also in organelles: 
mitochondria and (in photosynthetic organisms) chloro- 
plasts. The replication, transcription, and transla- 
tion of the eukaryotic genome occurs at either two or 
three distinct sites within the cell: in the 
nucleocytoplasmic region, in the mitochondrion, and in 
the chloroplast. 

The differences between prokaryotes and 
eukaryotes, however, breaks down when a comparison of 
mitochondria and chloroplasts is carried out with 
prokaryotes: these organelles are today considered to 
have been derived -from free-living prokaryotes, which 
entered into an endosymbiotic relation with primitive 
eukaryotes, and eventually became closely integrated 
with the machinery of the host cell and incapable of 
independent existence (see e.g.. Fox, G.E. et 
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al. Science 209: 457-463 (1980), at 462; Stanier, R. Y, 
et al , "The Microbial World," Fourth Edition, Prentice- 
Hall, Inc., 1976, at p. 86). For example, it has been 
demonstrated that DNA from mouse L cell mitochondria 
carrying the ribosomal RNA gene region exhibits notable 
sequence homologies to Escherichia coli ribosomal RNA, 
thus providing strong support for the endosymbiotic 
model (Van Etten, R. A. et al . Cell , 22:157-170 
(1980)). It has also been shown that the nucleotide 
sequence of 23S ribosomal DNA -from 2ea mays chloroplast 
has 71% homology with 23S ribosomal DNA from E. coli 
(Edwards, and Kossel, H> , Nucleic Acids • Research , 
9:2853-2869 (1981)); other related work ( Bonen, L. and 
Gray, M. W., ibid , £:319-335 (1980)) also further 
supports the general concept. 

In this model the eukaryotic cell is a phylo- 
genetic "chimera" with organelle components that are 
clearly prokaryotic in nature. The "prokaryotic- 
eukaryotic" dichotomy then, also has drawbacks, even as 
a broad classification method. 

Where classification of organisms becomes more 
than a scientific exercise is in the identification of 
plants and animals for hybridization and breeding pur- 
poses, and in the accurate and reliable identification- 
of microorganisms which may infect so-called "higher" 
organisms or other media. For example, the plant- 
breeder, cattle breeder, or fish breeder may wish to 
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have a quick and reliable means of identifying dif- 
ferent species and strains of their subjects • The 
veterinarian^ physician, or horticulturist may wish to 
have an accurate identification of any infectious 
5. organisms (parasites, fungi, bacteria, etc.) and 

viruses present in examined plant or animal tissues* 
The correct identification of species of these organ- 
isms and viruses is of particular importance. 

The problem can best be illustrated by referring 
10. to the identification of bacteria. Names of bacterial 
species usually represent many strains, and a strain is 
considered to be a population derived from a single 
cell. Bacterial species are usually defined by 
describing the degree of homogeneity and diversity of 
15^ attributes in representative samples of strains of 

species. Precise definitions of bacterial species are 
difficult to express because subjective limits to 
strain diversity within species are required to define' 
species boundaries. ( Buchanan, R. E., International 
20. Bulletin of Bacteriological Nomenclature and Taxonomy, 
15:25-32 (1965)). The practical application of 
definitions of species to the identification of an 
unknown bacterial -strain requires the selection of 
relevant probes, such as substrates and conditions to 
25. detect phenotypic attributes, and radioactively-labeled 
DNA from the same species. Because of the diversity of 
bacterial species, a screening procedure is the primary 
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tool used in the classical, progressive method for 
identification of a strain. Results of the screening 
procedure are then used to predict which other 
laboratory methods and reagents are relevant for 
definitive identification of the strain. 
Identification is ultimately based on certain 
phenotypic and genotypic similarities between the 
unidentified strain and characterized species. The 
challenge is to precisely define the boundaries of 
species, preferably in terms of a standard probe which 
reveals species-specific information, so that 
definitions of species can be directly and ^equally 
applied to the identification of unknown strains. 

Sergey's Manual of Determinative Bacteriology 
(Buchanan, R. E. and Gibbons, N. E., Editors," 1974, fith 
Edition, The Williams & Wilkins Company, Baltimore) 
provides the most comprehensive treatment of bacterial 
classification particularly for nomenclature, type 
strains, pertinent literature, and the like. It is, 
however, only a starting point for the identification 
of any species since, inter alia ,, it is normally out of 
date, and is limited in space to describing species 
quite-'* briefly. (See for example, Brenner, D, j., 
"Manual of Clinical Microbiology 3rd Edition, 
American Society of Microbiology, Washington, D.C., 
1980, pages 1-6.) 
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The term "species", as applied to bacteria, has 
been defined as a distinct kind of organism/ having 
certain distinguishing features, and as a group of 
organisms which generally bear a close resemblance to 
5. one another in the more essential features of their 
organization. The problem with these definitions is 
-that they are subjective} Brenner, supra, at page 2. 
Species have also been defined solely on the basis of 
criteria such as host range, pathogenicity, ability or . 
10. inability to produce gas in the fermentation of a given 
sugar, and rapid or delayed fermentation of sugars. 

In the 1960 *s, numerical bacterial taxonomy (also 
called computer or phenetic taxonomy) became widely 
used. Numerical taxonomy is based on an examination of 
15. " as much of the organism's genetic potential as pos- 
sible. By classifying on the basis of a large number 
- of characteristics, it is possible to form groups of 
strains with a stated degree of similarity and consider 
them species. Tests which are valuable for the 
20. characterization of one species / however, may not be 

useful for the next, so this means to define species is* 
not directly and practically applicable to the iden- 
1. tification of unknown strains. Although this may be 
overcome in part by selecting attributes which seem to 
25. be species specific/ when these attributes are used to 
identify unknown strains, the species definition is 
applied indirectly. See for example Brenner, supra , at 
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pages 2-6. The general method, furthermore , suffers 
from several problems when it is used as the sole basis 
for defining a species, among them the number and 
nature of the tests to be used, whether the tests 
5, should be weighted and. how, what level of similarity 
should be chosen to reflect relatedness, whether the 
same level of similarities is applicable to all groups, 
etc. 

Hugh, R« H> and Giliardi, G> L« , "Manual of 

10. Clinical Microbiology 2nd Edition, American Society 
for Microbiology, Washington, D.C., 1974, pages 250- 
269, list minimal phenotypic characters as -a means to 
define bacterial species that makes use of fractions of 
genomes. By studying a. large, randomly selected sample 

15. of strains of a species, the attributes most highly 

conserved or common to a vast majority of the strains 
can be selected to define the species. The uS|e of min- 
imal characters is progressive and begins with a 
screening procedure to presumptively identify a strain, 

20. so that the appropriate additional media can be se- 
lected. Then the known conserved attributes of the 
species are studied with the expectation that the 
• strain will haye most of the minimal characters. Some 
of the minimal characters do not occur in all strains 

25. .of the species. A related concept is the comparative 
study of the type, the neo-type, or a recognized ref- 
erence strain of the species. . This control is neces- 
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sary because media and procedures may differ among 
laboratories, and it is the strain, not the procedure, 
that is the standard for the species. 

A molecular approach to bacterial classification 
5^ is to compare two genomes by DNA-DNA reassociation, A 
genetic definition of species includes the provision 
that strains of species are 70% or more related. With 
DNA-DNA reassociation a strain can be identified only 
if the radioactively labeled DNA probe and unknown DNA • 
lO. are from the same species* The practical application 
of this 70% species definition however is limited by 
selection of an appropriate probe. This may be over- 
come in part by selecting phenotj^pic attributes which 
seem to correlate with the reassociation group, but 
15. when these are used alone the DHA-DNA reassociation 
species definition is also applied indirectly. 
Brenner, supra, at page 3, states that 

the ideal means of identifying bacterial 
species would be a 'black box* which 
20. would separate genes, and instantly 

compare the nucleic acid sequences in a 
given strain with a standard pattern for 
every known spe'cies-soaething akin to 
mass spec tropho tome trie analysis. 
25. Brenner, however, concedes that although restriction 
endonuclease analysis can be done to determine common 
sequences in isolated genes, "we are not at all close to 
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having an appropriate black box, especially one suited 
for clinical laboratory use*" His words could be equally 
applied to any species of organism. 

This brief review of the prior art leads to the 
5. conclusion that there presently exists a need for a 
rapid, accurate, and reliable means for identifying 
unknown bacteria and other organisms/ and to quickly 
classify the same, especially to identify the organism of 
a disease, or of a desirable biochemical reaction. The 
10. method should be generally and readily useful in clinical 
laboratories, should not be dependent on the number of 
tests done, on the subject prejudices of the clinician, 
nor the fortuitous or unfortuitous trial and error 
methods of the past. Further, a need also exists for a 
15. method useful for identifying and distinguishing genera 
and species of any living organism, which can be readily 
and reliably used by veterinarians, plant-breeders, tox- 
icologists, animal breeders, entomologists and in other 
related areas, where such identification is necessary. 
20. 'It is therefore an object of the invention to pro- 

vide a method which'* may:* be found:. to be quick, reliable 
and accurate of objectively identitying organisms, especi- 
ally - but not 'limited .to - microorganisms. 

"Yet another object of the invention is to provide a 
25. method of identifying organisms such as bacteria which 
utilizes the organisms' genome. 

" . Another object of the invention is to provide a 
method of characterizing and identifying species and 
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genera of pathogenic organisms in the clinical laboratory, 
so as to provide the capability of characterizing and 
identifying the cause of any given animal or plant disease. 

Still another object of the invention is to provide 
5^ various products useful in the aforementioned methodologies. 

These and other objects of the invention, as will 
hereinafter become more readily apparent, have been attained 
by providing: 

A method of characterizing an unknovm organism species 
lO, which comprises deteimining the position of part or whole of 
evolutionarily conserved sequences in genetic material of 
said organism, relative to a known position of restriction 
endonuc lease cleavage sites in said genetic material (other 
than by determining the chromatographic pattern of restric- 
IS.tion endonuclease digested DNA from said unknown organism, 
which digested DNA has been hybridized or reassociated with 
ribosomal RNA information containing nucleic acid from or 
derived from a known probe organism) , thereby to obtain an 
identifying genetic characterization of said xinknown 
20. organism, and comparing said characterization with informa- 
tion from at least two sets of identifying genetic charact- 
erizations derived from the same conserved sequences, each 
of said sets defining a known organism species. 

Still another object of the invention has been . 
2 5. attained by providing: 

A method of diagnosing a pathogenic organism infection 
in a sample which comprises identifying the organism in said 
sample by the aforementioned method. 
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For a better understanding of the invention, and to 
show how it may be put into effect, reference will now be 
made, by way of example, to the accompanying drawings in which 
FIGURE 1 shows the EcoR I restriction endonuclease 
5- digest of DNA isolated from strains of Pseudomonas 
aeruginosa, using cDNA to 16S and 23S ribosjomal RNA 
(rRNA) of E, coli as the probe. 

FIGURE 2 shows the Pst I restriction endonuclease 
digest of DNA isolated from strains of P> aeruginosa y 
10. using cDNA to 16S and 23S rRl^A of E. coli as the probe. 

FIGURE 3 shows the EcoR I restriction endonuclease 
digest of DNA isolated from species of glucose- 
nonf ermenting, gram-negative rods, using cDNA to 16S and 
23S rRNA of E. coli as the probe. 
15. FIGURE 4 shows the Pst I restriction endonuclease 

digest of DNA isolated from species of glucose- 
nonf ermenting, gram-negative rods using cDNA to 16S and 
23S rRl^A of E. coli as the probe. 

FIGURE 5 shows the EcoR I restriction endonuclease 
20- digest of DNA isolated from various Bacillus subtilis 

strains, using cDNA to 16S and 23S rRNA of E. coli as the 
probe . 

FIGURE 6 shows the Pst I data for the same strains 
as in FIGURE 5, with the same probe. " 
25* FIGURE 7 shows the Bgl II data for the same strains 

as in FIGURES 5 and 6,- with the same prober 

FIGURE 8 shows the Sac I data for the same strains 
as in FIGURES 5-7, with the same probe. 
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FIGURE 9 Shows the EcoR I restriction endonuclease 
digest of DIJA isolated from B, subtilis and B, polymyxa , 
using cDNA to 16S and 23S rRNA from E* coli as the probe* 

FIGURE 10 shows the Pst I data for the same strains 
5* as in FIGURE 9 with the same probe. 

FIGURE 11 shows the Bgl II and Sac I data for the 
same strains as in FIGURES 9 and 10, with the same probe, 

FIGURE 12 shows the detection of Streptococcus 
pneumoniae in EcoR I digested DNA from infected mouse 
10. tissues using cDNA from 16S and 23S rRNA from E, coli as 
the probe* 

FIGURE 13 shows the identification of a mouse spe- 
cies by comparing Pst I digests of DNA isolated from 
mammalian tissues r using cDNA to 18S and 28S rRl^A from 
15* cytoplasmic ribosoraes of Mus musculus domesticus (mouse). 

FIGURE 14 shows the EcoR I digested DNA from mouse 
and cat tissues hybridized with Mus musculus domesticus 
28S rRim cDNA probe. 

FIGURE 15 shows Sac I digested DNA from mammalian 
20. tissues hybridized with Mus musculus domesticus 18S and 
28S rRHA cDNA probe. 

FIGURE 16 shows EcoR I digested DNA from mammalian 
tissues and cell cultures hybridized with Mus musculaas 
domesticus 18S and 28S rRNA cDNA probe. 
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This invention is based on the inventor's reali- 
zation that, if species are discrete clusters of strains 
related to a common speciation events there should be, 
despite divergence, a likeness shared by strains that 
5. objectively defines the species boundary; strains of 

species should contain structural information which is a 
clue to their common origin. The greatest amount of an 
organism's past history survives in semantides, DNA and 
RNA, ( Zuckerkandle, E. and Pauling, L., Journal of 
10. Theoretical Biology , 8_:357-366 (1965)). 

In European Patent Application (EP-A-) • No. 0076123 
(herein incorporated by reference) the inventor 
described a system for the definition of species and 
characterization of organisms which makes use of 
15. information contained in ribosomal RNA genes (rRNA). 
Ribosomal RNA has a structural and functional role in 
protein synthesis ( Schaup, Journal of Theoretical 
Biology , 70:215-224 (1978)), and the general conclusion 
from rRNA-DNA hybridization studies, is that the base 
20. sequences of ribosomal RNA genes are less likely to 

change, or are more conserved during evolution, than are 
the nrajority of other genes (Moore, R. L. , Current Topics 
In Microbiology and Immunobiology , Vol. 64;105-128 
(1974X, Springer-Verlag, New York). For example, the 
25. primary structure of 168 rRNA from a number of bacterial 
species has been inferred from oligonucleotide analysis 
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( FoXf G> E. et al. International Journal of Systematic 
Bacteriology r 27 : 44-57 (1977)), There are negligible 
differences in the 16S oligomer catalogs of several 
strains of E. coli (Uchida, 1. et al ^ Journal of 
5. Molecular Evclutionr _3^:63-77 (1974)); the substantial 
differences among species, however, can be used for a 
scheme of bacterial phylogeny ( Fox, G>E>r Science, 
209 :457-463 (1980) )• Different strains of a bacterial 
species are not necessarily identical? restriction enzyme 
10. maps show that different EcoR I sites occur in rRNA genes 
in two strains of E> coli ( Boros, I>A. et al. Nucleic 
Acids Research 6:1817-1830 (1979)). Bacteria appear to 
share conserved rRNA gene sequences and the other 
sequences are variable (Fox , 1977, supra ) . 
15. The present inventor had thus discovered that 

restriction endonuclease digests of DNA have sets of 
fragments containing conserved sequences that are similar 
in strains of a species of organism (e»g*, bacteria), but 
different in strains of other species of the organism; 
20. i.e., despite strain variation, enzyme specific sets of 
restriction fragments with high frequencies of occur- 
rence, minimal genotypic characters, define the 
species. This is 'the essence of the invention described 
in EP-A-0076123. and* also that of the invention 
25. described herein. 

.The present invention constitutes an extension of 
the concepts developed in EP-A-0076123, in that 
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it has further been discovered that there exist sequences, 
in addition to those of rKNA, which are highly conserved 
through evolution and which may be as useful as rRNA 
sequences in the identification system. In other words, 

5. the present invention provides means for carrying out the 
identification and characterization techniques of EP-A- 
0076123, using any probe which is" conserved, other than 
rRNA. The present invention also provides additional 
examples of methods which may be used in the identification 

10. process. The present inventor has also discovered that 
the method is general, in that it is- applicable to both 
eukaryotic and prokaryotic DNA, using a conserved nucleic 
acid probe from any organism, prokaryotic or eukaryotic, 
of the same or different (classic) taxonomic classification 

15. than the organism being identified. 

The invention offers an objective method of defining 
organisms based on conserved sequences of DNA or other 
genetic material in relation to known positions such as 
restriction endonuclease sites. The detection of re- 

20. striction fragments containing a conserved sequence may 

be carried out by hybridizing or reassociating DNA segments 
with nucleic acid containing conserved sequence information 
from a probe organism. 

By the "organism" which can- be characterized (which 

25. term is meant to include "identified") by the process of 

the invention, it is meant to include virtually any organism 
which, by definition, contains DNA or RNA in its genome. 
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In this respect it is useful to refer to a classical tax- 
onomic scheme as a point of reference. 

All organisms belonging to the Kingdoms Monera^ 
Plantae and Animalia are included. For example / among 
5. those of the Kingdom Monera can be mentioned the 

Schizomycetes (Bacteria) of the classes myxobacteria/ 
spirochetes, eubacteria, rickettsiae,* and the cyanopytha . 
(blue green algae). Among those of the Kingdom Plantae 
can be mentioned the Division Euglenophyta (Euglenoids)^ 
10. Division Chlorophyta (green-algae) classes chlorophyceae 
and charophyceae r Division Chrysophyta, classes xantho- 
phyceae; chrysophyseae , bacillariophyceae ; Division 
Pyrrophyta (Dinof lagellates) y Division Phaeophyta (Brown 
algae) Division Bhodophta (Red algae); Division 
15. Myxomycophyta (slime molds )r classes myxomycetesj 

acrasiae, plasmodiophoreae, labyrinthuleae ? Division 
Eumycophyta (true fungi), classes phycomycetes, ascomy- 
cetes, and basidomycetes ; Division Bryophta, classes 
hepaticae, anthocerotae, and musci; Division Tracheophyta 
20. (Vascular plants), subdivisions psilopsida, lycopsyda, 
sphenopsida, pteropsida, spermopsida classes cycadae, 
ginkgoae, coniferae, gneteae and angiospermae subclasses 
dicotyledoneae, monocotyloedoneae. Among those of the 
Kingdom Animalia can be mentioned the Subkingdom Proto- 
25. zoa. Phylum Protozoa (Acellular animals) subphylum 

plasmodroraa, classes flagellata, sarcodina and sporozoa; 
subphylum ciliophora, class ciliata; the Subkingdom 
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Parazoa, Phylum porifera (Sponges), class calcarea, 
hexactinellida, and desmospongiae; the Subkingdom 
Mesozoa, Phylum mesozoa; the Subkingdom Metazoa, Section 
Radiata, Phylum coelenterata, classes hydrozoa/ scypho- 
5, - zoa, anthozoa. Phylum ctenophora, classes tentaculata and 

- nuda; Section Protostoraia Phylum platyhelmintes (flat- 
worms) classes tubellana, trematoda, and cestoda; Phylum • 
nemertina; Phylum acanthocephala ; Phylum aschelraintles, 
classes rotifera, gastrotricha, kinorhyncha/ priapulida, 

10. nematode and nematomorpha; Phylum entoprocta; Phylum 
ectoprocta, classes gymnolaemata and phylactolaemata; 
Phylum phoronida; Phylum braciopoda, classes inarticulata 
and articulataj Phylum mollusca (molluscs) classes 

15. amphineura/ monoplacophora , gastropoda, scaphopoda, 

• pelecypoda, and cephalopoda; Phylum sipunculida; Phylum 

- echiurida; Phylum annelida, classes polychaeta, 
oligochaeta and hirudinea; Phylum onychophora; Phylum 
tardigrada; Phylum pentastoraida; Phylum arthropoda, 

20. subphylum trylobita, subphylum chelicerata classes 

xiphosura, arachmida, pycnogomida, subphylum mandibulata 
classes Crustacea/ chilopoda, diplopoda, pauropoda, 
symphyla, insecta of the orders collembolar protura, 
diplura, thysanura, ephemerida, odonata, orthoptera, 

25. dermaptera, embiania, plecoptera, zoraptera, corrodent ia, 
mallophaga, anoplura, thysasnoptera, hemiptera, neurop- . 
tera, coleoptera, hymenoptera, mecoptera, siphonaptera, 
diptera, trichoptera and lepidoptera; those of the 
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Section Deuterostomia/ phylum chaetognatha, phylum 
echinodermata, classes crinoidea/ asterordea, ophiuroi- 
dea, echinoidea, and holoturoidea, phylum pogonophora; 
phylum hemichordatar classes enteropneusta/ and 
5. pterobranchia; phylum chordata, subphyluan urochordata, 
classes ascidiaciae, thaliaceaer larvacea; subphylum 
cephalochordatar subphylum vertebrate, classes agnatha, - 
chondrichthyes, osteichthyes {subclass saccopteiygii 
orders crossopterygii and dipnoi)/ amphibia, repitilia,. 
10. aves and mammalia/ subclass prototheria, subclass theria, 
orders marsupial ia, insect ivora, dermoptera, chiroptera, 
primates/ edentata, pholidota, lagomorpha/ rodentia, 
cetaceae, carnivora/ tubulidentata, probosicdea, 
hyracoidea/ sirenia, perissodactyla and artiodactyla. 
15, It is understood that beyond the order, the organ- 

isms are still classified according to their families, 
tribes, genus and species, and even subspecies, infra- 
subspecific taxons, and strains or individuals. In 
addition, cell cultures (plant or animal), as well as 
20. viruses can also be identified. These classifications 
are used in this application for illustrative purposes • 
only, and are not to be taken as exclusive. The organism 
is either known or unknown, most commonly the organism ias 
an unknown being identified. 
25. Functionally, for the purposes of this invention, it 

is convenient to divide all organisms into the eukaryotes 
and the prokaryotes. When identifying a prokaryotic 
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organism^ the DNA to be analyzed is that present in the 
cells or in the non-compartmentalized chromosomes. When 
identifying a eukaryotic organism one may either use the 
nuclear DNA or the organelle DNA (mitochondrial DNA or 
5. chloroplast DNA). 

Briefly r High molecular weight DNA and/or small 
circular DNAs are isolated from the organism to be iden- 
tified in order to analyze the conserved sequences (and 
possibly sequences that could be used to create a taxon 
10, below the rank of species or inf rasubspecif ic 

subdivisions.) The DNA's are extracted by methods which 
are well-known to the art. 

The DNA's are analyzed to ascertain both 1) the 
presence and position of the conserved sequences and 2) 
15. their position: relative to endonuclease restriction 

sites ♦ The easiest way to analyze for the presence of 
the conserved _sequences is to utilize a polynucleotide 
probe capable of hybridizing with the conseirved DNA 
sequence • However/ direct sequence information as 
20. obtained by chemical sequejice determination and analysis 
thereof could also be utilized. In EP-A-0076123 the 
probe utilized was an-xRNA . inf ormation containing-probe; 
lii this case any other" pfcobe having conserved sequences 
could be used.' In an analogous manner, the easiest way 
25. of finding a given set of endonuclease restriction sites 
is to cleave the DNA with the appropriate restriction 
enzymes. (This, indeed, is the manner taught and 
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practised in EP-A-0076123. However, alternative methods, 
such as sequence information coupled with known restric- 
tion site sequences, or cleavage and partial sequencing 
could also be used- 
5. Most conuuonly DNA's are going to be cut at specific 

c 

sites into fragments by restriction endonucleases . The 
fragments are separated according to size by a 
chromatographic system. In EP-A-0076123 gel 

chromatography was used as an example of a useful 

10. chromatographic system. However, other systems can also 
be used, such as high pressure liquid chromatography r 
capillary zone electrophoresis r or other separation 
techniques. In using gel chromatography, the fragments 
are separated, the gels are stained, as is otherwise 

15^ well-known in the art, and standardized as to the 

fragment sizes using standards curves constructed with 
fragments of known sizes. The separated fragments may 
then be transferred to cellulose nitrite paper by the 
Southern blot technique (Southern, E. M, , Journal of 

2Q^ Molecular Biology , 38 s503-517 (1975), herein incorporated 
by reference), and covalently bound thereto by heating. 
The fragments containing the conserved sequences are then 
located by their capacity to hybridize with a nucleic 
acid probe containing conserved sequence information* 

25. Alternatively, hybridization can occur after digestion 
but before separation; or restriction cleavage can occur 
after hybridization, followed by separation of the 
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fragments . 

The nucleic acid probe can either be non- 
radioactively labeled or, preferably, radioactively 
labeled. When radioactively labeled, the probe can be 
5. RNA, or preferably DNA which is complementary to RNA 
(cDNA), either synthesized by reverse transcription or 
contained on a cloned -fragment, which can be labeled, for 
example, by nick translation. Also, synthetic 
oligodeoxyribonucleotides nay be prepared with labeled 
10. nucleotides. 

The well-defined probe is derived from an 
arbitrarily chosen organism, see infra, or may be a 
consensus sequence. Once hybridization has occurred, the 
hybridized fragments are detected by selectively 
15.- detecting double stranded nucleic acid (non-radiolabeled 
probe), or visualized by , e.g., autoradiography (radio- 
labeled probe). The -size of each fragment which has been 
hybridized is relative to the restriction sites and is* 
determined from the d.istance traveled using standard 
20. curves, as described previously. The amount of 

hybridization, the pattern of hybridization, and the 
sizes of the hybridized fragments, which are relative to 
restriction sites, can be used individually or in 
conjunction to identify the organism. 
25. The genetic characterization that emerges from this 

technique can be readily compared to equivalent char- 
acterizations derived from at least two and up to a 
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multiplicity of knownr standard organisms, genera or 
species. After a preliminary broad classification has 
already been carried out (using, for example, classical 
taxonomy )# the comparison can be either by visual inspec- 

5. tion and matching of appropriate chromatographic 
patterns, (as in EP-A-0076123) by comparison of 
hybridized restriction fragment sizes, by band intensity 
(amount of hybridization) or by any combination there- 
of. Ideally, the comparison is carried out with a one- 

10. dimensional computer-based pattern recognition system, 
such as those used in point-of-sale transactions. 

The present inventor discovered that when using the 
aforementioned method, the genetic characterizations for 
organisms of the same, species are substantially similar, 

15. with minor variations allowed for intraspecies differ- 
ences due to strain variations, whereas differences 
between species, and differences between genera (and 
higher classifications) are maximal. 

The use of enzyme-specific fragment variations among 

20. strains of a species permits the typing of strains for 
various purposes; e.g. in the case of bacteria, for 
epidemiological purposes. In fact, restriction enzymes 
can be chosen for -thf ir ability to distinguish strains 
within species. 

25. The "probe organism" used in the present invention, 

and from which is obtained the nucleic acid probe, can 
also be any of the aforementioned organisms; it can be 
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either eukaryotic or prokaryotic. The only limitation is 
given by the fact that the conserved sequence-containing 
probe should hybridize maximally with the unknown 
organism's DNA. 

There are four types of -conserved sequence 
information-containing probes: 1) prokaryotic- probes 
(especially bacterial-derived) r 2) eukaryotic mito- 
chondrial probes, 3) eukaryotic chToroplast probes , and 
4) eukaryotic non-organelle probes. There are also four 
• sources of DNA (to be endonuclease digested): 1) 
prokaryotic cellular DNA, 2) eukaryotic mitochondrial " 
DNA, 3) eukaryotic chloroplast DNA, and 4) eukaryotic 
nuclear DNA. The following hybridization table can thus 
be constructed (Table 1). 
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Conserved Gene Sequence Probe 



Unknown organism 

DNA Prokaryotic Eukaryotic 



Prokaryotic 
Eu . t ^ ^ Mitochondria 
Eu. Chloroplast 
Eu. Nuclear 



Mito- Chloro Non- 
chondrial plast organelle 
+ + + - 

+ + -f - 

+ + + - 



(1) Eu = Eukaryotic 

(2) « refers to generally less effective hybridizationr 
see Example. . 4, infra. 
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The Table shows which probes can generally be 
maximally hybridized with which unknown DNA. For 
example/ one can identify a eukaryotic organism by 
extracting species specific mitochondrial or chloroplast 
5. DNAr endonuclease-digesting it and hybridizing the digest 
with either a prokaryotic prober or with an organelle 
derived eukaryotic probe* In the same manner, one can 
identify a prokaryotic organism by extracting species- 
specific cellular DNA, endonuclease-digesting it, and 
10. hybridizing the digest with either a prokaryotic probe, 
or an organelle-derived eukaryotic RIQA probe. Also, one 
can identify a eukaryotic organism by extracting and di- 
gesting species-specific nuclear DNA, and hybridizing it 
with a non-organelle derived eukaryotic probe* 
15. Eukaryotes could be defined by one or any combination of 
the nuclear, mitochondria, or in some cases chloroplast 
systems. These cross-hybridizations are based on the 
fact that nucleic acid derived from eukaryotic organelles 
^ has extensive homology with evolutionarily conserved 
20. sequences from prokaryotic nucleic acid, but that the 

same homologies are generally not present to such extent 
between nuclear-derived eukaryotic DNA and prokaryotic 
DNA. 

The choice of any. pair of DNA to be digested and 
25. accompanying probe is arbitrary, and will depend on the 
organism being identified, i.e. it will depend on the 
question asked. For example, in detecting a prokaryotic 
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species (e.g. bacteria) present in or together with a 
eukaryotic cell (e.g. animal or plant) for purposes of 
detecting and identifying an infecting agents one may 
choose a prokaryotic probe and work under conditions 

5. where organelle-derived DNA is not extracted or only 
minimally extracted. In this manner one assures that 
interference between organelle-derived DNA and prokaryo- 
tic DNA is minimal. In identifying a eukaryotic species 
(which is not infected with a prokaryote) with a 

lO. prokaryotic probe, it is best to maximize the concentra- 
tion of organelle-derived DNA, as for example by 
separating organelles from nuclei, and then extracting 
only organelle DNA. If one wishes to identify a 
eukaryotic organism which has been infected with a 

15. prokaryotic organism, it is best to use a non-organelle , 
non-prokaryotic derived probe since it will generally not 
hybridize well with the DNA from the prokaryote. 

It is preferred to use a pair (DNA and probe) from 
the same kingdom, or same subkingdom, or same section, or 

20« same phylum, or same subphylum, or same class, or same 
subclass, or same order, or same family or same tribe or 
same genus. It is particularly preferred to use 
prokaryotic probe (e.g. bacterial probe) to hybridize 
with prokaryotic DNA. In this manner one could detect, 

25. quantify, and identify genera, species, and strains of 
prokaryotic organisms. One of the most preferred 
prokaryotic probes is derived from bacteria, and further. 
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because of the ease and availability, from E. coli. The 
probe from E> coli can be used to identify any organism, 
especially any prokaryotic organism, most preferably a 
strain of any bacterial species • Another particularly 

5. preferr^id embodiment is to use eukaryotic probe derived 
from a given class to identify eukaryotic organisms of 
the same class (e.g. mammalian probe to identify mammalian 
organism) . Most preferred is to use probe and DNA from 
the same subclass and/or order and/or family of organisms, 

10* (e.g. if identifying a species of mouse, it is preferred 
to use mouse-derived probe) . 

The roost sensitive and useful pair systems are those 
where there is less evolutionary distance or diversity 
between the source of the probe and the unknown DNA. 

15. The phrase "evolutionarily conserved genetic material 

sequence" is used in this invention to denote genetic 
material, e.g DNA, sequences that show homology between 
at least two different species of plants, animals or micro- 
organisms. The homology between two conserved sequences 

20. is to be such that, if one of such DNA molecules were to 
be detectably labelled, sufficient hybridization or an- 
nealing would occur if both single stranded DNA molecules 
or fragments thereof were placed together under hybridiz- 
ation conditions, thereby to produce a duplex of sufficient 
25. stability to be detectable by standard methodology Ci.e, 
radiolabelling, enzyme labelling, and the like). 

In EP-A-0076123 the evolutionarily conserved 
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sequence exemplified was that of ribosomal RNA 
genes. This is still a highly preferred gene sequence. 
Howeverf it has been discovered that other gene sequences 
exist which are sufficiently conserved across the 
5* evolutionary span to be useful. 

Examples of such additional sequences are those of 
genes "or portions thereof coding for transfer RNA^ 
or proteins denoted as belonging to the same Superf amily/ 
or same Family/ preferably same Subfamily or even same 
10. entry in Dayhoff 's "Atlas of Protein Sequence and 

Structure", Volume 5, Supplement 3/ 1978/ NBR, 1979, 
pages 9-24/ herein incorporated by reference. A Family 
of proteins is one wherein any two proteins differ from 
each other by less than 50% amino acid residues in their 
15. sequence. A Subfamily of proteins is one wherein any two 
proteins differ from each other by less than 20% amino 
acid "residues in their sequence* An "Entry" is one 
wherein any two proteins differ from each other by less 
than r5% amino acid residues : i their sequence. 
20. Specific examples of -ge:.e sequences or appropriate 

portions thereof which can b: used are: cytochrome C 
related genes, cytochrome C3 related genes, cytochrome cj^ 
related, cytochrome bg relatfd, ferrodoxin related, 
rebredoxin related, flavodo n related., alcohol 
25. dehydrogenase related, lacti :e dehydrogenase related, 
peroxidase related, adenyla.2 kinase related, 
phospholipase A2 related, t: 'ptophan operon related. 
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carboxypeptidase related, subtilisin related, 
penicillinase related, protease inhibitor related, 
somatotropin related, corticotropin related, lipotropin 
related, glucagon related, snake venom toxin related, 
5. plant toxin related, antibacterial toxin related, 

imihunoglogulin related gene, ribosomal other than rRNA- 
related genes, heme carrier genes, chromosomal protein 
genes, fibrous protein genes, and the like. 

The conservation of some of these additional DNA 
10. sequences is not as widespread throughout the animal, 

plant or microbiological domains as is that of the rRNA 
genes. (Thus the' still preferred use of rRNA). This, 
however, does not constitute a serious impediment to 
tUeir use since it may be possible to utilize such 
15. additional sequences to identify or characterize 

organisms within more limited ranges or subdomains. For 
example it may be possible to utilize trp D gene 
sequences from bacteria to generate a trp D bacterial * 
probe and then use this probe to test within the 
20. bacterial domain. In fact, it may be possible to use a 
trp D probe within an even narrower domain (e.g., test 
for the presence of Enterobacteriaceae , or of Bacillus , 
etc.) with a trp D probe from the same order, family or 
genus. Thus, while the range of applicability of some of 
25. the additional probe sequences may not be as broad as 
that of rRNA probes, their applicability will 
nevertheless be quite effective within narrower domains. 
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A probe containing the conserved DNA sequence 
information is prepared in the same manner as the 
preparation of rRl^A information containing probe exem- 
plified in EP-A-0076123. The probe can thus be 
5. RNA, DNA or cDNA, and the like. 

The individual steps involved in the technique will 
be described hereinafter broadly with reference to both 
eukaryotic and prokaryotic cells when applicable, or 
specifically for each type of cell if some difference in 
10. technique exists. 

The first step is extraction of the DNA from the 
unknown organisms. Nuclear DNA from eukaryotic cells can 
be selectively extracted by standard methodology wellr 
known to the art (see for example, Drohan, W. et al, 
15. Biochem. Biophys. Acta , 521 (1978), 1-15, herein 

incorporated by reference). Because organelle DNA is 
small and circular, spooling techniques serve to separate 
the non-circular nuclear DNA from the circular, 
organelle-derived DNA. As a corollary, the non-spooled 
20. material contains the organelle-derived DNA which can 
separately be isolated by density gradient centrifu- 
gation. Alternatively, mitochondria (or chloroplasts) 
are separated from a mixture of disrupted cells; the 
purified mitochondrial (or chloroplast) fraction is used 
25. for the preparation of organelle-derived DNA while the 
purified nuclear fraction is used to prepare nuclear 
DNA. (See for example Bonen L. and Gray, M. W. , Nucleic 
Acids Research, 8:319-335 (1980)). 
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Prokaryotic DNA extraction is also well-known in the 
art. Thus, for example, an unknown bacterium present in 
any medium, such as an industrial fermentation suspen- 
sion, agar medium, plant or animal tissue or sample or 

5, the like, is treated under well-known conditions designed 
to extract high molecular weight DNA therefrom. For 
example, cells of the unknown organism can be suspended 
in extraction buffer^ lysozyme added thereto, and the 
suspension incubated* Cell disruption can be further 

10. accelerated by addition of detergents, and/or by increase 
in temperature. Protease digestion followed by 
chloroform/phenol extraction and ethanol precipitation 
can be used to finalize the extraction of DNA. An 
alternative method of extraction, which is much faster 

15. than phenol/chloroform extraction, is rapid isolation of 
DNA using ethanol precipitation. This method is pre- 
ferably used to isolate DNA directly from colonies or 
small, liquid cultures. The method is described in 
Davis, R. w. et alt "A Manual for Genetic Engineering, 

20. Advanced Bacterial Genetics, •* (hereinafter "Davis"), Cold 
Spring Harbor Laboratory, Cold Spring Harbor, New York, 
1980, pp. 120-121, herein incorporated by reference. 

The DNA (prokaryotic or eukaryotic (nuclear or non- 
nuclear)) is dissolved in physiological buffer for the 

25. next step. There are a variety of possible steps to be 
followed after isolation of the desired DNA. One of 
these steps is endonuclease digestion. 
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Digestion of extracted DNA is carried out with 
restriction endonuclease enzymes. Any restriction endo- 
nuclease enzyme can be used* Preferably it is not from 
the same organism species as that being identified, since 
5. otherwise, the DNA may remain intact. (This may, in any 
events identify the organism, since the enzymes are not 
expected to cut DNA from the species of their origin.) 
Since the organism species being characterized may be 
unknown, obtaining a suitable digest of fragments may 
10. entail a minimum amount of trial and error, which can 
routinely be carried out by those skilled in the art 
without undue experimentation. Examples of possible 
restriction endonuclease enzymes are Bgl I, BamH I, EcoR 
I, Pst I, Hind III, Bal I, Hga 1, Sal I, Xba I, Sac I, 
15. Sst I, Bel I, Xho I, Kpn 1, Pvu II, Sau Ilia, or the 
like. See also Davis, supra , at pp. 228-230, herein 
incorporated by reference. A mixture of one or more 
endonucleases can also be used for the digestion. 
Normally, DNA and endonuclease are incubated together in 
20. an appropriate buffer for an appropriate period of time 

(ranging from 1 to 48 hours, at temperatures ranging from 
25*»C-65**C, preferably 37*»C). 

The resulting identifying genetic characterization 
will depend on the type or types of endonucleases 
25. utilized, and will be endonuclease-specif ic. It is 

therefore necessary to note which enzyme or enzymes have 
been used for the digestion since comparative character- 
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izations used in a catalog should have been prepared 
using the same enzyme or enzymes. 

An alternative step is to define endonuclease sites 
on the desired DNA molecules without digestion thereof, 
for example, by sequencing and reference to a restriction 
site library. Obviously, digestion is the more efficient 
method of noting such sites, but the method need not be 
limited thereto. The essence of the invention is the 
discovery that the position of conserved sequences along 
DNA, relative to the position of endonuclease restriction 
sites, forms a set which is characteristic for each 
species. Thus, any technique which yields the desired 
information (position of the genes vis a vis position of 
the sites) will be useful in the invention. 

Also, the position of the conserved sequences along 
the DNA molecule is best noted by use of a hybridization 
probe. This probe is allowed to anneal to restriction 
fragments of the unknown's DNA. However, any other 
method that would allow the determination of the 
conserved DNA sequences, such as sequencing, would also 
be useful. When using the hybridization probe it is 
preferred to first digest and separate DNA fragments 
according to size, and then to hybridize the separated 
fragments. However, it is possible to first digest and 
anneal DNA with a molar excess of probe and /or sequences 
complementary to probe and then separate the mixture. 
For example, unknown DNA can be digested with a 
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restriction endonucleaser denatured r and hybridized in 
liquid with a molar excess of one or more small 
detectably labeled DNA fragments or synthetic 
oligodeoxyribonucleotides complementary to a portion or 

5. portions of the conserved sequence of interest. Since 
most restriction enzymes cut fairly infrequently in DNA, 
in most cases the double-stranded region or regions of 
the hybrid will be small relative to the size of the 
restriction fragment. The hybridization reaction is 

lO. conducted under conditions where only the 

oligodeoxyribonucleotides hybridize. The unreacted, 
single-stranded DNA fragments, and the DNA fragments 
containing the hybridized oligodeoxyribonucleotides are 
separated by conventional chromatographic techniques. 

15. The labeled DNA fragments will appear in predictibly 

sized fractions. It is also possible to first anneal DNA 
with a molar excess of probe , then digest , and then 
separate the mixture. When the solution is incubated for 
a short time period or to a low C^t, the restriction 

20. sites will be limited to the hybridized, double-stranded 
regions. IThen the solution is incubated for a long time 
period or to a high C^t, the unknown DNA will anneal, 
thus creating a labeled duplex susceptible to restriction 
endonuclease cleavage. Dnreassociated single-stranded 

25. tails may be removed with a nuclease such as SI. 

Unpaired bases may be filled in using DNA polymerase I or 
T4 polymerase. 
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Alternatively/ one could find subsequences within 
the conserved sequence information (e.g. 20-, 30-/ or 50- 
mers)/ which are more highly conserved than the remainder 
of the conserved region chosen as the probe. Those 
5. "shorter" sequences can be made synthetically or 

• enzymatically/ if desired/ and may incorporate labeled 
nucleotides • Single-stranded/ predigested DNA from the 
unknown is allowed to incubate with these shorter,, highly 
conserved fragments and allowed to hybridize thereto. 
10. Separation would then be carried out on the digest 
mixture containing fragments partly annealed to the 
shorter labeled probes. (Thus, separation would occur 
after hybridization.) Separation could be by liquid 
chromatography/ since the digest mixture would for all 
15. practical purposes behave as a mixture of essentially 
single-stranded fragments. 

As indicated/ a preferred method is to first digest/ 
then separate, and then hybridize. Thus, after 
endonuclease digestion/ the. incubation mixture/ which 
20. contains fragments of varying sizes / is preferably 

separated thereinto by an appropriate chromatographic 
method. Any method which is capable of separating nu- 
cleic acid digests according to size, and which allows 
the eventual hybridization with the nucleic acid probe 
25. when hybridization is the last step, can be used. For 
example/ gel electrophoresis, high pressure liquid 
chromatography or capillary zone electrophoresis can be 



-36- 



0120658 



used. (Gorgenson/ J.W. , J, of HRC and CC, Aj 230-231 
(1981)). Presently preferred is gel electrophoresis, 
most preferred is agarose gel electrophoresis. In this 
system, the DNA digests are normally elect rophoresed in 

5- an appropriate buffer, the gels are normally immersed in 
an ethidium bromide solution, and placed on a DV-light 
box to visualize standard marker fragments which may have 
been added* Detectably labeled standard marker fragments 
may be used as well. 

10. After separation and visualization, the DNA frag- 

ments are transferred onto nitrocellulose filter paper or 
onto charge-modified nylon membranes by the method of 
Southern (Journal of Molecular Biology, 38:503-517 
(1975)). The transfer can be carried out after 
denaturation and neutralization steps, and is usually 
done for long periods of time (approximately 10-20 hours) 
or, alternatively by means of an electrically driven 
transfer from gel to paper. Instruments used to 
accelerate the transfer from gel to .paper are com- 
mercially available. The receiving nitrocellulose filter 
papers are then normally baked at high temperatures (60- 
80*C) for several hours, to bind the DNA to the filter. 

Alternatively, transfer can be^-avoided by using the 
recent method • of direct hybridization of Purrello, M. et 

25. a]^. Anal. Biochem. , 128: 393-397 {1983) . ' 

The probe utilized for the hybridization of the 
paper-bound DNA digest fragments is a defined nucleic 
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acid probe preferably from or derived from a given well- 
defined organism or the base sequence is known. 

Alternatively, the probe sequence may not have a 
natural conterpart; i.e., it may be a consensus sequence 

5, with a base at each position that is most commonly 
present at that residue in a number of equivalent 
sequences in different species. The consensus sequence- 
is then generally able to form a more stable hybrid than 
any one of the naturally occurring sequences* The probe 

10. niay be a synthetic oligodeoxyribonucleotide molecule made 
by covalently attaching individual nucleotides in a 
predetermined sequence. Synthetic molecules may be 
prepared, for example, by the triphosphate method of 
synthesis (Alvarado-Urbina et al . Science 214 ; 270-274 ; 

15. (1981) )• The probe molecules may be of any useful size, 
and more than one sequence may be in the probe 
solution. For example, several 20 base sequences might 
be used to detect several highly conserved regions in • 
rRNA genes. It may be detectably labelled or non-labeled, 

20. preferably detectably labeled. In such case, it is 
either detectably labeled RNA, but preferably nick- 
translated labeled DNA, cloned DNA, or detectably labeled 
DNA which is complementary to the RNA from the probe 
organism (cDNA), all of which contain highly conserved 

25. DNA sequence information. Synthetic 

oligodeoxyribonucleotides may be prepared with detectably 
labeled nucleotides, so the molecule is labeled by 
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incorporating labeled nucleotide residues. Depending on 
the choice of pair/ the probe, may. be from a prokaryote, 
or from a eukaryote (cytoplasm-derived, or organelle 
derived). Most preferably, the detectable label is a 

5. radioactive label such as radioactive phosphorus (e.g., 

^^P, or -'■^C) or a biotin/avidin-based system. The 
nucleic acid probe may also be labeled with metal 
atoms. For example, uridine and cytidine nucleotides can 
form covalent mercury -derivatives. Mercurated nucleoside 

T 0 

" triphosphates are good substrates for many nucleic acid 
polymerases, including reverse transcriptase (Dale et al. 
Proceedings of the National Acadeny of Sciences 70 : 2238- 
2242, 1973). Direct covalent mercuration of natural 
nucleic acids has been described. (Dale et al , 

15. Biochemistry 14: 2447-2457) . Reannealing properties of 
mercurated polymers resemble those of the ' corresponding 
nonmercurated polymers (Dale and terd. Biochemistry 
14:2458-2469) . Metal labelled probes can be detected, 
for example, by photo-acoustic spectroscopy, x-ray 
.20. spectroscopy, e.g., x-ray fluorescence, x-ray absorbance, 
or photon spectroscopy. 

The isolation and preparation of any desired 
conserved DMA sequence-containing probe is within the 
skill of the art. For example, the isolation of rRNA 

25. from eukaryotes or prokaryotes is well-known in the 

art. Thus, to prepare rRNA from eakaryotic cytoplasmic 
ribosomes, RNA can be extracted from whole cells or 
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ribosomes, separated by sucrose gradient centrif ugation, 
and the 18S and 28S fractions can be collected using 
known molecular weight markers. (See for example. Perry, 
R, P, and Kelly, D. E. , "Persistent Synthesis of 55 RNA 

5. Wlien Production of 28S and 18S Ribosomal Bl^h is Inhibited 
by Low Doses of Actinomycin D," J. Cell, Physiol I, 
72;235-'246 (1968), herein incorporated by reference). As 
a corollary, organelle-derived rRNA is isolated and 
purified from the organelle fractions in the same manner 

lO. (see e.g. Van Etten, R. A. et al . Cell , 22;157-170 

(1980), or Edwards, K. et al. Nucleic Acids Research/ 
9_:2853-2869 (1981)). 

If radioactively labeled probe is used, the same is 
isolated from the probe organism after growth or= cultiva- 

15 tion of the organism with nutrients or in culture media 
containing appropriately radioactive compounds. TThen the 
probe is complementary DNA (cDNA), the same is prepared 
by reverse transcribing isolated RNA from the probe 
organism, in the presence of radioactive nucleoside 

20. triphosphates (e.g., -^^P-nucleosides or -^H-nucleosides) . 

The labeled probe may also be a nick-translated DNA 
molecule, especially one obtained from organelle-derived 
whole circular DNA. In this embodiment, chloroplast or 
mitochondrial DNA is nick-translated in the presence of 

25. radiolabel, and a labeled DNA probe is thereby 

obtained. The chloroplast labeled probe will hybridize 
best with chloroplast DNA, and the mitochondrial labeled 
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probe will hybridize best with mitochondrial DNA. The 
chloroplast (or mitochondrial) nick-translated labeled 
probe will hybridize second best with mitochondrial (or 
chloroplast) DNA; it will also hybridize, albeit 
5. generally in less favorable fashion, with whole plant (or 
animal) DNA. The probe My also be obtained from 
eukaryotic nuclear DNA by nick-translation, although 
practical considerations would rule against this mode. A 
more useful approach in this embodiment is to cut out the 
10. highly conserved genes from the nuclear eukaryotic DNA 
(by restriction enzymes), separate the fragments, 
identify the gene sequences (as by hybridization), and 
isolate said gene sequences (as by electrophoresis). The 
isolated sequences may then be recombined into a plasmid 
15. or other vector, and after transformation of an 
appropriate host, cloned in 32p_containi -g media. 
Alternatively, the transformed host is crown, and the DNA 
is then isolated and labeled by nick-trmslation; or the 
DNA is isolated, the sequences are cut :ut and then 
20. labeled. The resulting ribosomal probe will hybridize in 
the same instances as cDNA (see infra). 

The preferred nucleic acid probe :s radioactively 
labeled DNA complementary to KNA f rom »he probe organ- 
ism. The RNA is usually messenger RN? coding for a. 
25. conserved gene and is substantially fa«.e of other RNA's 
such as transfer RNA (tRNA) or ribosonl RNA (rRNA) 
(unless rRHA is used). If rRNA were t: be used. 
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prokaryotic rRNA normally contains three subspecies: the 
so-called 5S, 16S and 23S fragments. The reverse trans- 
cription into cDNA can be carried out with a mixture of 
all three, or alternatively, with a mixture of 16S and 

5. 235 fragments* It is less preferred to carry out the 
reverse transciption with only one of the rRNA 
components, although under certain conditions this may be 
feasible. Eukaryotic rRNA normally contains two 
subspecies: 18S and 28S, and the reverse transcription 

10. into cDNA can be carried out with a mixture of 18S and 
28S fragments or with each. 

The pure RNA, substantially free of other types of 
RNA, is incubated with any reverse transcriptase capable 
of reverse transcribing it into cDNA, preferably with 

15. reverse transcriptase from avian myeloblastosis virus 

(AMV) in the presence of a primer such as calf thymus DNA 
hydrolysate. The mixture should contain appropriate 
deoxynucleoside triphosphates, wherein at least one of* 
said nucleosides is radioactively labeled, for example 

20. with 32p^ pqj. example, deoxycytidine 5'-(^-^P), 

deoxythymidine S'-i-^^P), deoxyadenine 5'-(^^P), or 
deoxyguanidine 5'-(^^P) triphosphates can be used as the 
radioactive nucleosides. After incubation, from 30 
minutes to 5 hours at 25**C-40"*C, extraction with chloro- 

25. form and phenol, and centrif ugation as well as chroma- 
tography, the radioactively labeled fractions are pooled, 
and constitute the cDNA probe. The radioactively labeled 
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cDNA probe containing conserved DNA information in 
substantially purified fomir i.e^r free of non-labeled 
molecules r free of cDNA which is complementary to other 
types of RNA, free of proteinaceous materials as well as 

5. free of cellular components such as membranes, organelles 
and the like, also constitutes an aspect of the present 
invention. A preferred probe is prokaryotic labelled 
cDNA/ most preferred being the bacterial labelled cDNA. 
The probe species can be any bacterial microorganism, 

10. such as those of the family Enterobacteriaceae , Brucella, 
BacilluSf Pseudoynonas, Lactobacillus y Haemophilus, 
Mycobacterium , Vibrio, Neisseria, Bactroides and other 
anaerobic groups, Legionella, and the like* Although the 
prokaryotic examples in the present application are 

15. limited to the use of E. coli as a bacterial prokaryotic 
probe organism, this aspect of the invention is by no 
means limited to this microorganism. The use of cDNA in 
radioactively labeled form as the probe is preferred to 

20. the use of radioactively labeled KNA because DNA has 
greater stability during hybridization. 

It is important to recognize that the labeled cDNA. 
probe should be a faithful copy of the RNA, i.e. be one 
wherein all nucleotide sequences of the template RHA are 

25. transcribed each time the synthesis is carried out. The 
use of a primer is essential in this respect. That the 
cDNA is a faithful copy can be demonstrated by the fact 
that it should have two properties following hybrid- 
ization: 
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1. The cDNA should protect 100% of labeled RNA from 
ribonuclease digestion; and 

2. The labeled 'CDNA should specifically anneal to 
the RNA as shown by resistance to SI nuclease. 

5, Beljanski M> et al , C.R. Acad. Sc Paris t 286/ 

Serie D. p. 1825-1828 (1978 )r described % radioactively 
labeled cDNA derived from E. coli rRHA. The cDNA in this 
work was not prepared with reverse transcriptase in the 
presence of a primer as in the present invention/ but was 
10. prepared with a DNA polymerase i; using as a template 
rRNA which had been pre-cleaved using ribonuclease 
The rRNA digestion product (with RNAse ,of Beljanski 
et al has a different base ratio from the initial rRNA, 
indicating a loss of bases and/or loss of short frag- 
15^ ments. Thus the cDNA obtained therefrom is not a faith- 
ful copy. In addition, the use of DNA polymerase I used 
by Beljanski is known to favor predominance of homo- 
polymeric over heteropolymeric transcription of rRl^A (see 
Sarin, P. S. et al , Biochem . Biophys . Res. Comm . , 59 t 202- 
20. 214 (1974)). 

In sum, the probe can be seen as being derived a) 
from genome DNA containing conserved sequences, e.g. 
genes, by cloning and/or nick-translation, b) from RNA 
itself or c) from cDHA by reverse transcription of RNA. 
25. Normally, the next step in the process of the inven- 

tion is the hybridization of the separated DNA digest 
from the unknown organism with the unlabeled or 
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(preferably) radioactively labeled RNA or DNA probe. 
Hybridization is carried out by contacting the paper 
containing covalently labeled DNA digest from the 
unknown, with a hybridization mix containing the probe. 

5. Incubation is carried out at elevated temperatures (50- 
7 0*^0) for long periods of time/ filter papers are then 
washed to remove unbound' radioactivity (if needed)/ air 
dried and readied for detection. An alternative/ highly 
preferred hybridization/ which is much more rapid than . 

lO. the one described above/ is the room temperature phenol 
emulsion reassociation technique of Kohne, D* E. et al/ 
Biochemistry/ 16:5329-5341 (1977)/ which is herein incor- 
porated by reference. 

After hybridization/ the technique requires selec- 

15. tive detection of the appropriately hybridized frag- 
ments. This detection can be carried out by taking 
advantage of the double strandedness of the hybridized 
fragments and using a selective method therefor (for non- 
labeled probe), or by autoradiography or by an appro- 
20. priate radiation scanner which may or may not be 
computerized/ and which may increase the speed of 
detection (for labeled probe). These techniques are well 
known to those skilled ^in the art and will not be further 
described at this point. 
25 • The end product of . the technique is an identifying 

genetic characterization/ such as a chromatographic band 
pattern having peaks and troughs, or preferably, light 
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and dark regions of various intensities, at specific 
locations. These locations can be readily matched to 
specific fragments sizes (in kilobase pairs) by intro- 
duction into the separation technique of a marker, such 

5. as EcoR I digested X bacteriophage DNA, In this manner, 
both the relative position of the bands to each other, as 
veil as the absolute size of each band can be readily 
ascertained. The identifying genetic characterization 
for the unknown is then comparied with characterizations 

10. present in a catalog or library. The catalog or library 
can consist of a book containing characterizations for at 
least two, and up to a virtually unlimited number of de- 
fined different organisms genera and species. For 
example, the number of pathologically relevant bacteria 

15. that cause human disease is estimated to be about 100, so 
it is estimated that a standard catalog of pathogenic 
bacteria would contain anywhere between 50 and 150 such 
characterizations. A catalog of types of bacterial 
strains for epidemiological typing systems can also be 

20, included. 

The characterizations will depend on the type or 
types of endonuclease enzymes selected, possibly on the 
particular organism used as the source for the radio- 
actively labeled probe (the probe organism), and on the 

25 • composition of the conserved DNA sequence information 
nucleic acids utilized to prepare the probe (e.g. 
containing either prokaryotic rRNA 5S, 16S or 23S 
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subtypes, or only 16S and 23S/ or consensus sequences or 
the like). Thus, the catalog may, for each probe, 
contain a variety of enzyme-specific characterizations, 
with the. size of each band listed, and with the relative 
5* intensity noted. As the concentration of the bound DNA 
boiind to the filter decreases , only the most intense 
bands can be seen, and the size of this band or bands can 
thus identify species. Any variation or permutation of 
the above can of course be used for the library. 
10- Additionally, for a eukaryotic organism the library may 
contain patterns . that result from the use of one type of 
DNA or any combination of organelle and/or nuclear DNA. 
The pattern for each DNA digest will depend on the probe 
composition. The catalog may be arranged so that, if more 
15. than one strain or species is present in the extracted 
sample and detected by the probe, the resulting 
characterization can be interpreted. 

A user can either compare the obtained characteriza- 
tion, e.g., band pattern, visually, or by aid of a one- 
20- dimensional, computer assisted, digital scanner program- 
med for recognition of patterns. These computer scanners 
are well known in the art of the time-of-sale transac- 
tions (the commonly utilized "supermarket" check-out bar 
code or pattern readers). Ideally, the library or catalog 
25, is present in a computer memory both in terms of the 

relative characterizations for a plurality of organisms, 
and in terms of the absolute values of molecular weight 
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or size of the fragments. The catalog comparison then 
consists of matching the unknown characterization with 
one of the characterizations present in the library by 
means of either one or both of the stored information 

5- elements (relative characterizations and /or absolute size 
elements). The intensity of each band when compared to a 
standard can also reveal the amount of bound DNA hybrid- 
ized/ and thus can be used to estimate the extent of the 
presence of an organism^ for example a prokaryote in a 

10* eukaryote. 

If a user wishes to further confirm the nature and 
identification of a given organism, such user can digest 
the unknown with a second, different endonuclease, and 
compare the resulting characterization to catalog. 

15.. characterizations of the organism for the second chosen 
endonuclease. This process can be repeated as many times 
as necessary to get an accurate identification* 
Normally, however, a single analysis with a single probe 
would be sufficient in most instances. 

20. The present invention and its variations can be used 

for a myriad of applications. It may be used by plant or 
animal breeders to correctly identify their subjects, or 
it may be used by clinical and microbiological labor- 
atories to identify bacteria, parasites or fungi present 

25. in any medium, including in eukaryotic cells. In this 
latter use, the method is preferred to the standard 
microbiological assays, since it does not require 
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microbiological assays, since it does not require 
isolation and growth of the microbes* In vitro growth 
and characterization is now either impossible for some 
microorganisms such as Mycobacterium leprae (agent of 

5. leprosy )r impossible on standard media for some micro-r 
organisms such as the obligate intracellular bacteria 
(e.g. rickettsia, chlamydia, etc), or highly dangerous 
(e.g. B anthracis (agent of anthrax)). The present 
method depends on the isolation of nucleic acid and 

10. avoids these problems since it avoids conventional 

bacterial isolation and characterization. The method is 
expected to detect microorganisms that have not yet been 
conventionally described. In addition, the present 
method allows distinguishing different strains of 

15. species, and this can be useful, for example, for epi- 
demiological typing in bacteriology. The method can be 
used by forensic laboratories to correctly and unaralDig- 
uously identify plant or animal tissues in criminal 
investigations. It can also be used by entomologists to 

20. quickly identify insect species, when ascertaining the 
nature of crop infestations. 

In addition, upon the conjunction of the method with 
the identification of inf rasubspecif ic taxons (such as 
e.g., nitrogenase genes in plant roots, see Hennecke, H. 

25. 291 Nature 354 (1981)), the methodology can be utilized 
to search for and identify the genotypes of individual 
strains. 
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The method of this invention is preferably used for 
the identification of microorganisms wherever they may be 
found. These microorganisms may be found in physio- 
logical as well as non-physiological materials. They may 

5. be found in industrial growth media, culture broths f or 
the like^ and may be concentrated for example by centri- 
fugation. Preferably, the microorganisms are found in 
physiological media, most preferably they are found in 
animal sources infected therewith. In this latter 

10. embodiment/ the method is used to diagnose bacterial 
infections in animals, most preferably in humans. The 
detection and identification of bacterial DNA with a 
prokaryotic probe is highly selective and occurs without 
hindrance, even in the presence of animal, (e.g., mam- 

15. malian) DNA. If a prokaryotic probe is used, conditions 
can be selected which minimize hybridization with mito- 
chondrial DNA, or mitochondrial bands can be subtracted 
from the pattern. The t-echnique can thus be used in 
clinical laboratories, bacterial depositories, industrial 

20. fermentation laboratories, and the like. 

Of particular interest is the possibility of detect- 
ing, in addition to the species and strain identity of 
the infecting microorganism, the presence in the micro- 
organism of any specific genetic sequences. For example, 
25. it is possible to detect the presence, of antibiotic 
resistance sequences found on R factors, which are 
transmissible plasraids mediating drug resistance. One 
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can add labeled R-f actor DNA or cloned labeled antibiotic 
resistance sequences to the hybridization mixture in 
order to correctly determine the antibiotic resistance of 
the organism, (an extra band or bands would appear) / or 
5* one can rehybridize the once hybridized filter in the 

presence of added antibiotic resistance sequence probe or 
probes. Alternatively one could separate the unknown DNA. 
into aliquots, and test the first aliquot for identifica- 
tion, the second for the presence of drug resistance 
10. sequences, the third for toxin genes, etc. Taterna- 
tively, one could use conserved gene information 
containing probe labeled with one radionuclide (e.g. P) 
in a hybridization mixture with added R-f actor probe 
labeled with a different radionuclide (e.g. -^H or -^^C). 
15. After hybridization, the presence of R-f actor DNA in the 
unknown DNA can be tested by scanning with two different 
scanners: one for species and strain identification 
(e.g. ^^P), the other for drug resistance, or the like 
(e.g. or ^"^C). In this manner the lab can, without 
20. isolating and characterizing the microorganism, identify 
the genus and species, type the strain and test for drug 
resistance, possible toxin production or any other 
character or taxon below the rank of species that can be 
detected with a labeled nucleic acid sequence or probe, 
25. all in one experiment. 

The R-factors are universal and cross species 
boundaries, so that identification can be carried out in 



•51- 

0120658 

any bacterial genus or species with the same R-factor 
probe (see Tomkins, L. S, et al ,. J, Inf> Dis, , 141 :625- 
636 (1981) ). 

In addition, the presence of viruses or virus- 

5. related sequences in eukaryotes or prokaryotes can also 

be detected and identified in coHjunction with the method 
of the inventions Any of the viruses described in 
"Manual of Clinical Microbiology", 3d edition, edited by 
Lennette, E.H., Amer> Soc> Microb,, 1980, 774-778 can be 

10, identified, e.g., picornaviridae, caliciviridae, 
reoviridae, togaviridae, orthomyxoviridae , 
paramyxoviridae, rhabdoviridae, retroviridae, 
arenaviridae, coronaviridae, bunyaviridae, parvoviridae, 
papovaviridae, adenoviridae, herjpesviridae , vidoviridae 

15^ and poxviridae. : 

A) \ihen the viral genome is integrated into host DNA 
(as with DNA viruses, for example members of 
Papovaviridae, and RNA viruses, for example, members of 
Retroviridae), high molecular weight DNA is extracted 

20. from the tissue and digested with restriction endo- 

nucleases. The overall procedure is the same as used fpr 
bacteria. The choice of a viral probe again depends on 
the question asked, and on the extent of homology between 
the "probe virus" and the viral related sequences to be 

25, detected. In order to have suitable sequence homology, 
it may be necessary that the probe and tissue sequences 
are related to the same family, genus, or species of 
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virus. In addition to the extent of conserved sequences, 
whether or not a viral probe hybridizes to viral related 
sequences in host DHA may be determined by the hybrid- 
ization conditions, which can be stringent or relaxed. 

5, The result of the hybridization will be a band or a 

pattern of bands showing that there are viral sequences 
incorporated into the host DNA. This information may be 
useful in helping to predict the occurrence of cancer. 
The probe can be any labelled complementary nucleic acid 

10. probe including cloned viral sequences. For KNA viruses, 
for example viral RNA can be used to make a DNA with 
reverse transcriptase; for DNA viruses, for example, 
viral DNA labelled by nick translation can be used. 
Again multiple probes can be used,' especially with 

15. different labels. - 

Same general features apply equally to DNA and El^A 
viruses. Viral genomes are relatively small, so the pre- 
cipitated nucleic acid is preferably collected by cen- 
trifugation; all of the procedures can use the total 

20* nucleic acid or the various procedures can be run 

separately. It is expected that viral nucleic acid can 
be concentrated by spooling cellular DNA to remove it 
before centrif ugation. This can also be used to deter- 
mine if the viral genome is integrated. 

25. For the viral probe to hybridize , it may be neces- 

sary and at least most preferred that the probe be from 
the same family, genus, or species as the unknown. 
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Reaction conditions^ stringent or relaxed, may determine 
whether or not a given probe hybridizes a distantly 
related genome. The probe may be cloned viral sequences 
that are labeled, or may be the complete genome or a 
5. portion of it. 

The technique described by Southern , supra is useful 
for the transfer of large DNA fragments (greater than 
about 0*5 kilobases) to nitrocellulose paper after alkali 
denaturation-. This technique might be useful for DNA . 

10. viruses but not for RNA viruses. RNA has been trans- 
ferred and covalently coupled to activated cellulose 
paper (diazobenzyloxymethyl-paper ) , and this can be used 
for KNA viruses. The modification of the Southern 
technique by Thomas (Thomas, P., Proc, Nat> Acad. Sci., 

15. USA, 21-5201-5205 (1980)) can be used for the efficient 
transfer of RNA, and small DNA fragments to nitrocel- 
lulose paper for hybridization. RNA and small DNA 
fragments are denatured with glyoxal and dimethyl sul- 
foxide, and electrophoresed in agarose gel. This 

20. procedure transfers DNA fragments between 100 and 2000 

nucleotides and Rl^lA efficiently, and they are retained on 
the nitrocellulose paper during hybridization. This is 
useful for small ribosomal DNA fragments as well. So it 
is most preferred to divide the restriction-enzyme 

25. digested specimen and denature the nucleic acid in one 

portion with glyoxal. The Southern and Thomas procedures 
would yield a maximum amount of information. 
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B) For DNA viruses, restriction analysis can be 
carried out with double-stranded (DS) viral DNA's to 
identify viruses present. Single-stranded (SS) DNA 
viruses will have different genome lengths. The probe 

5^ (the sequence information could be converted to DS DNA) 
that hybridizes, the hybridized fragment pattern and/or 
the -sizes or size can be used to identify viruses. Kiere. 
are again a number of ways to obtain complementary nu- 
cleic acid probes. For example, for DS DNA niclc- 

10. translation can be used; for SS DNA, DNA polymerase can 
be used to synthesize a cDNA. 

C) For Rl^A viruses, RNA is not digested by restric- 
tion endonucleases (the sequence information could be 
converted to DS DNA). The genomes of different RNA 

15. viruses are of different sizes, and some RNA viruses have 
more than 1 molecule in their genome. This, along with 
the base sequences detected by certain probes or pooled 
probes allows the RNA viruses to be identified. An 
example of a probe would be cDNA synthesized using viral 

20. RNA. 

When searching for infectious agents in specimens it 
is possible to search directly by extracting nucleic acid 
from the specimen, or by culturing first in media or 
cells to increase the number of agents, or by using a 
25. concentration step such as centrif ugation, or by trying 
all approaches. 
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The present invention lends itself readily to the 
preparation of "kits" containing the elements necessary 
to carry out the process. Such a kit may comprise a 
carrier being compartmentalized to receive in close con- 

5. finement therein one or more container means, such as 

tubes or vials. One of said container means may contain 
unlabeled or detectably labeled nucleic acid probe , such 
as for example the radioactively labeled cDNA to RNA from 
the organism probe, (preferably prokaryotic cDNA in the 

10. case of a kit to identify bacteria). The labeled nucleic 
acid probe may be present in lyophilized form, or in an 
appropriate buffer as necessary. One or more container 
means may contain one or more endonuclease enzymes to be 
utilized in digesting the DNA from the unknown organ- 

15. ism. These enzymes may be present by themselves or in 
admixtures, in lyophilized form or in appropriate 
buffers. Ideally, the enzymes utilized in the kit are 
those for which corresponding catalogs have been pre- * 
pared. Nothing stops the user, however, from preparing 

20. his or her own comparative standard at the moment of 

experiment. Thus, if a user suspects that an unknown is 
in fact of a given genus or species, he or she may pre- 
pare the identifying characteristics for the known and 
compare it with the characterization for the unknown. 

25. The kit may thus also contain all of the elements neces- 
sary, in order to carry out this sub-process. These 
elements may include one or more known organisms, (such 
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as bacteria), or isolated DNA from known organisms. In 
addition^ the kit may also contain a "catalog", defined 
broadly as a booklet/ or book, or pamphlet, or computer 
tape or disk, or computer access number, or the like, 
5. having the identifying characterizations for a variety of 
organisms of a certain group, such as plant species, 
mammal species, microbe species, especially patho- 
logically important bacteria, insect species or the 
like. In this mode, a user would only need to prepare 
10. the characterization for the unknown organism, and then 
visually (or by computer) compare the obtained 
characterization with the characterizations in the 
catalog. The kit may also contain in one container probe 
RNA for probe synthesis, in another container radio- 
15. labeled deoxyribonucleoside triphosphate, and in another 
container primer. In this manner the user can prepare 
his or her own probe cDNA. 

Finally, the kit may contain all of the additional 
elements necessary to carry out the technique of the 
20. invention, such as buffers, growth media, enzymes, 

pipettes, plates, nucleic acids, nucleoside triphos- 
phates, filter paper, gel materials, transfer materials, 
autoradiography supplies, and the like. It may also 
contain antibiotic resistance sequence probes, viral 
25. probes, or other specific character probes. 

Having now generally described this invention, the * 
same will be better understood by reference to certain 



-57- 



01 20658 



reference experiments and specific examples which are 
included herein for purposes of illustration only and 
are not intended to be limiting of the invention, 
unless specified. 
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MATERIALS AND METHODS 

A. Bacterial 

Extraction of High Molecular Weight DNA 

Bacterial broth cultures were centrifuged and the 
cells were washed with cold saline. The cells were 
5. suspended in a volume measured in ml of extraction buffer 
(0.15 M sodium chloride, 0.1 M EDTA, 0.03 M tris pH B.5) . 
approximately 10 times the gram weight of the packed 
cells. Lysozyme at 10 mg/ml was added to 0.5 mg/ml final 
concentration. The suspension was incubated at 37 ^^C for 
10. 30 minutes. Cell disruption was completed by the 

addition of 25% SDS to 2.5% final concentration, and 
raising the temperature to 60^C for 10 minutes. After 
cooling in a tap water bath, mercaptoethanol was added to 
X5. 1% final concentration. Pronase® at 20 mg/ml in 0.02 M 
tris pH 7.4 was predigested at 37*^0 for 2 hours and then 
added to 1 mg/ml final concentration. The solution was 
incubated at 37*^0 for 18 hours. Phenol was prepared by 
mixing one liter redistilled phenol, 2.5 liters double 
20. distilled water, 270 ml saturated Tris base, 12 ml mer- 
captoethanol, and EDTA to lO^^M final concentration and 
allowing the mixture to separate at 4**C. The phenol was 
washed with wash buffer (10"^M sodium chloride, 10"% 
EDTA, lOmM tris pH 8.5). Then an equal volume of fresh 
25, buffer was added. Mercaptoethanol was added to 0.1% 

final concentration. The solution was mixed and stored 
at 4**C. One half volume prepared phenol and one half 



-59- 



0120658 



volume chloroform was added to the lysed cell solution. 
This was shaken for approximately 10 minutes and cen- 
trifuged at 3;400 x g for 15 minutes. The aqueous phase 
was removed with an inverted 25 ml glass pipette. The 

5. extraction procedure was repeated until there was little 
precipitate at the interface. One-ninth volume 2 N 
sodium acetate pH 5.5 was added to the aqueous phase* 
Two times volume of 95* ethyl alcohol at -20**C was poured 
slowly down the side of the flasX. The end of a Pasteur 

10. pipette was melted close and used to spool the precipi- 
tated DNA. High molecular we'ight DNA was dissolved in 
buffer (10"'^EDTA, 10^h\ tris pH 7.4). The concentration 
of DNA was determined by absorbance at 260 nm using 30 
micrograms per absorbance unit as conversion factor, 

15. 

Restriction Endonuclease Digestion of DNA. 

- EcoR I restriction endonuclease reactions were 
performed in 0.1 M tris-HCl pH 7.5, 0.05 M NaCl, 0.005 M 
MgCl2f micrograms per ml bovine serum albumin. 

20. EcoR I reaction mixtures contained 5 units of enzyme per 
microgram of DNA, and were incubated four hours at 
37**C. PST I restriction endonuclease reactions were 
performed in 0.006 M tris-HCl pH 7.4, 0.05 M sodium 
chloride, 0.006 M magnesium chloride, .0.006 II 2- 

25. raercaptoethanol, and 100 micrograms per ml of bovine 

serum albumin. PST I reaction mixtures contained 2 units 
of enzyme per microgram DNA, and were incubated four 
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hours at 37**C. Usually 10 micrograms DNA was digested in 
a final volume of 40 microliters* Ten times concentra- 
tion buffers were added. Sterile distilled water was 
added depending on the volume of DNA. X Bacteriophage 
5. DNA was restricted with EcoR I to provide marker bands 
for fragment size determinations. Usually 2 micrograms 
X DNA was digested with 20 units EcoR I in a final volume 
of 20 microliters. 

10, Gel Electrophoresis and DNA Transfer. 

DNA digests were supplemented with glycerol, to 
about 20 %r and bromophenol blue tracking dye. In the 
case of X DNA digests, 20 microliters of Ix EcoR I buffer 
was added to each 20 microliter reaction mixture. 

15. Usually 15 microliters 75% glycerol and 5 microliters 
0.5% bromophenol blue were added to each 40 microliter 
reaction mixture. 

10 micrograms digested bacterial DNA or 2 micrograms 
digested X DNA were loaded per well and overlaid with 

20. molten agarose. Digests were electrophoresed in 0.8% 

agarose with 0.02 M sodium acetate, 0.002 M EDTA, 0.018 M 
trie base, and 0.028 M tris HCl pH 8.05 at 35 V until the 
dye migrated 13 to 16 cm. Gels were then immersed in 
ethidium bromide (0.005 mg/ml) and placed on a UV-light 

25. box to visualize the X fragments, DNA was transferred to 
nitrocellulose filter paper by the method of Southern, 
supra. Gels were treated with denaturing solution (1.5 M 
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sodium chloride, 0,5 M sodium hydroxide) on a rocker 
table for 20 min. Denaturing solution was replaced with 
neutralization solution (3.0 M sodium chloride, 0.5 M 
tris HCl, pH 7.5), and after 40 minutes the gels were 

5. checked with pH paper. Following neutralization, the 
gels were treated with 6 x SSC buffer (SSC = 0.15 M 
sodium chloride, 0.015 M sodium citrate) for 10 
minutes. DNA fragments were transferred from the gel to 
the nitrocellulose paper by drawing 6 X SSC through the 

10. gel and nitrocellulose paper with a stack of paper tpwels 
for 15 hours. Falters were placed between two sheets of 
3 MM chromatography paper, wrapped in aluminum foil, 
shiny side out, and dried in a vacuum oven at 8Q°C for 4 
hours . 

15. 

Synthesis of ^^P ribosomal RNA Complementary DNA (-^^p 

rRNA cDWA). 
32 

P-labeled DNA complementary to E> coli R-13 23S 
and 16S ribosomal RNA was synthesized using reverse 

20. transcriptase from avian myeloblastosis virus (AMV) . The 
reaction mixture contained 5 microliters 0,2 H dithio- - 
threithol, 25 microliters 1- M tris pH 8.0, 8,3 micro- 
liters 3 M potassium chloride, 40 microliters 0*1 M 
magnesium chloride, 70 micrograms actinoraycin, 14 micro- 

25* liters 0.04 M dATP, 14 microliters 0.04 M dGDP, 14 

microliters 0.04 M dTTP and 96.7 microliters HjO. The 
following were added to a plastic tube: 137.5 microliters 
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reaction mixture, 15 microliters calf thymus primer (10 
mg/ml), 7 microliters H 20, 3 microliters rRNA {using 40 
micrograms /OD unit concentration, is 2 #76 
micrograms/microliters), 40 microliters deoxycitydine 5'- 
5. ("^^P) triphosphate (10 mCi/ml), and 13 microliters AIIV 
polymerase (6,900 units yl. The enzymatic reaction was 
incubated 1.5 hours at 37^C. Then the solution was ex- 
tracted in 5 ml each of chloroform and prepared phenol* 
After centrifugation (JS 13,600 RPM), the aqueous phase 

10. was layered directly on a Sephadex^ G-50 column (1.5 x 22 
cm) • A plastic 10 ml pipette was used for the column. A 
small glass bead was placed in the tip, rubber tubing 
with a pinch clamp was attached, and degassed G-50 
swelled in 0.05% SDS overnight was added. The aqueous 

15. phase wall allowed to flow directly into the G-50 and was 
then eluted with 0.05% SDS. 20 fractions at 0.5 ml each 
were collected in plastic vials. Tubes containing peak 

c 

fractions were detected by Cerenkov counting using a 
discriminator, counting for 0.1 min. per sample and re- 
20. cording total counts. Peak fractions were pooled. 

Aliquots were added to Aquesol® (commercially available), 
and the CPM of *^^P per ml was determined by scintillation 
counting. 
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Hybridization and Autoradiography > 

Fragments containing ribosomal RNA gene sequences 
were detected by autoradiography after hybridization of 
.the DNA on the filters to ^^P-rRI^A cDNA. Filters were 
5. soaked in hybridization mix (3 Xr SSC/ 0*1% SDS, 100 
micrograms/ml denatured and sonrcated canine DNA, and 
Deinhart's solution (0.2% each of bovine serum albumen, 
Ficollf and polyvinyl pyrrolidine)), for 1 hour at 
68**C. ^^P rRNA cDNA was added at 4 x 10^ CPM/ml, and the 
10. hybridization reaction was incubated at 68*^0 for 48 

hours. Filters were then washed in 3 x SSC, and 0.1% SDS 
at 15 min. intervals for 2 hours or until the wash solu- 
tion contained about 3,000 cpm ^^P per ml. Filters were 
air dried, wrapped in plastic wrap and autoradiographed 
15. approximately 1 hour with Kodak X-OMAT R film at -70**C, 
B . Mammalian experiments > ' Mas luusculus domesticus 
(mouse) rRNA probes were synthesized from IBS and 28S, 
and only 28S rRNA. Nucleic acid was extracted from mouse 
liver and precipitated. High mSlecular weight DNA was 
20. spooled and removed. The remaining nucleic acid was 

collected by centrif ugation and. dissolved in buffer, 50* 
mM MgCl2 and 100 ml'l Tris pH* 7.4. DNAse (RNAse free) was 
added to a concentration of 50pg/ml. The mixture was 
incubated at 37 "^C for 30 min. The resulting RNA was 
25. rextracted, ethanol precipitated, and dissolved in ImM 
sodium phosphate buffer pH 6*8 A 5 to 20% sucrose 
gradient in O.IM Tris pH 7.4 and O.OIM EDTA was pre- 
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pared • The sample was added and the gradients spun in an 
SW40 rotor 7 hr. at 35K RPM. Fractions were collected by 
optical density* The 18S and 28S fractions were selected 
by comparison to known molecular weight markers • 

5. For all of the mammalian experiments relaxed hybrid- 

f 

ization conditions were used, 54**C. „ The washing pro- 
cedure, .carried out at 54**C/ was 3 separate washes with 
BxSSC with 0.05% SDS for 15 min. each. 

Reference Experiments 1-8 describe experiments carried 
10. out with rRNA- information cont-aining probes. Examples 1 to 
3 describe computer simulations utilizing a histone gene 
information-containing probe, a tryptophan operon trp D 
gene-information containing probe, and an a-f etoprotein 
gene information-containing probe, respectively. 

15. 

Reference Experiment J. 
Bacterial Species are Defined by Restriction 
Endonuclease Analysis of Ribosomal RNA Genes 
The several strains of P. aeruginosa used in this 
20. example have the minimal phenotypic characters which 
identify the species {Hugh R.H., et al, in: Manual of 
Clinical Microbiology, 2d Ed. ASM, 1974, pp. 250-269), 
(Table 2). Strains of three other Tseudomonas and two. 
Ac i ne toba c te r species were selected to compare species 
25. and genera (Table 3). 
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TABLE 2 

Corresponding strain numbers of isolates with the 
minimal phenotypic characters of P» aeruginosa 
for the comparison of Strains • 



JRH 


ATCC 


151 


10752 


809 


7701 


810 


8689 


811 


8707 


812 


8709 


815 


10145 


1559 


14425 



15. Strains used for comparison of Pseudomonas and 
Acinetobacter species are listed in Table 3» 



20. 
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TABLE 3 

Corresponding strain numbers of type, neotype and 
reference strains for the comparison of 
species and genera 



Species 


RH 


ATCC 


NCTC 


Strain Status 


P. aeruginosa 


815 


10145 


10332 


type 


P. stutzeri 


2601 


17588 




neotype 


P. fluorescens 


818 


13525 


10038 


neotype 


P. putida 


827 


12633 




neotype 


A. anitratus 


2208 


19606 




type 


A. Iwoffii 


462 


7976 




reference 



Acinetobacter species were selected for comparison 
15. of genera because they share certain attributes with many 
Pseudomonas species. 

The sizes (kilobase pairs) of fragments in EcoR I 
digests are: stutzeri 16.0, 12-0, 9.4; P. fluorescens 
16.0, 10. Or 8.6, 7.8, 7.0; P. putida 24.0, 15.0, 10.0, 
20. 8.9; A. anitratus 20.0, 15.0, 12.5, 9.8, 7.8, 6.1, 5.2, 
4.8, 3.8, 2.8 (size of the smallest 3 fragments not cal- 
culated) ; A^J^raffii^ 12.0, 10.0, 9.1, 7.0, 6.4, 5.7, 5.5, 
5.3, 4.8, 4.4, 3.6, 3.2, 2.9 (size of the smallest 3 
fragments not calculated). The sizes (kilobase pairs) of 
25. fragments in PST I digests are? P. stutzeri 6.7, 6.1, 

5.5; P. fluorescens 10.0, 9.4, 7.8, 7.0; P. putida 10.5, 
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9.9, 6*8, 6.3, 4>4; A> anitratus 36.0, 28.0, 20.5, 12.0, 

10.0, 5.8, 3*7, 2.6, 2.4; h. Iwoffi 9.9, 8.7, 7.2, 5.7, 
4.0, 3*6, 3.2, 2.7. 

Comparison of the hybridized restriction fragments 
5. from the seven strains of P. aeruginosa leads to the 

conclusion that this species can be defined by an EcoR I 
specific set of fragments containing rRNA gene sequences, 

10.1, 9.4, 7.6, and 5.9 kilobase pairs (KBP) (FIGURE 
1). The 7.6 KBP EcoR I fragment occurs in 4 of the 7 . 

10. strains in this sample. An analogous situation occurs 
among certain phenotypic characters of strains of 
species. The fact that the EcoR I sets of fragments from 
the 7 strains can be used to separate the strains into 
two groups prompts speculation that there may be two 
15. species with the minimal phenotypic characters of P. 

aeruginosa . The results of experiments in which DNA was 
digested with PST I (FIGURE 2) lead to the conclusion 
that the strain variation shown by the EcoR I 7.6 KBP * 
fragment represents variation within the species, since 
20. there is a single conserved set of PST I fragments, 9.4, 
7.1, 6.6, and 6.4 KBP, that define the species. The 9. "4 
and 6.6 KBP Pst I fragments* occur in 6 of the 7 strains 
of P. aeruginosa; ' the 7.1 and 6.4 KBP PST I fragments 
occur in all of the strains sampled. PST I fragment 
25. variation occurs in strains that do not contain an EcoR I 
7.6 KBP fragment; RH 151 has 10.1 and 8.2 KBP fragments, 
RH 809 does not contain a 9.4 KBP fragment and has a 6.0 
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KBP fragment r and RH 815; the type strain, does not con- 
tain a 6*6 KBP fragment. The patterns of hybridized 
fragments support the conclusion that enzyme specifier 
conserved sets can be used to define species. Strains of 
5. a species probably have a majority of the fragments in 

the conserved set. The occurrence of fragment variations 
in some strains does not prevent identification and may 
prove useful in epidemiological studies. 

The occurrence of variation r EcoR I 7.6 KBP fragment 
10. in P> aeruginosa strains, may be put into perspective by 
examining hybridized EcoR I fragments found in the type 
strains of other Pseudomonas species (FIGURE 3). The 
type strains of P. stutzeri, P. fluorescens, and _P^ 
putida do not contain a 7.6 KBP fragment, but do have 
15. EcoR I fragments of the sane size in common; P. 

aeruginosa and P. stutzeri each have a 9.4 KBP fragment, 
p. stutzeri and P. fluorescens each have a 16 KBP frag- 
ment, and P. fluorescens and P. putida each have a 10 KBP 
fragment. In general, the sizes of the fragments are 
20. unique in the type strains, of each of the 4 Pseudomonas 
species; and the type strain of each species has a dif- 
ferent size range of fragments. These general comments 
are also true for the PST I digests (FIGURE 4). 

When the fragment patterns or one strain of each of 
25. the 4 Pseudomonas and 2 Acinetobacter species are com- 
pared, it can be concluded that the species of each genus 
are similar, but the genera differ. The 2 Acinetobacter 
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species have a greater range of hybridized fragment sizes 
than do the 4 Pseudoinonas species . 

Without the aid of restriction enzyme maps such as 
those availble for E. coli> Bacillus thuringiensis and B. 
subtiliSy it is not possible to predict where enzymes cut 
rRNA genes, the number of copies per genome, whether 
there are heterologous flanking regions between genes or . 
gene heterogeneity. The E« coli rRNA cDNA probe may fail 
to hybridize with some restriction fragments containing. 
rRNA gene sequences, and if so, this reflects the evolu- 
tionary distance- or diversity between the test organism 
and E, coli . The conserved nature of rRNA can be used to 
argue that this is not the case. However, this is a 
minor problem compared to the advantage of having a stan- 
dard probe that can be equally applied to any unknown 
species. 

Reference Experiment 2 
Comparison of Restriction Analysis with DNA-DNA 

Liquid Hybridization ; 

The strains used in this study are listed in Tables 

4 and 5. 
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Table 4 

Corresponding Strain Numbers 
of Neotype strains of B. subtilis 
and type strains of junior synonyms 

5. 









Strain 


Species 


RH 


ATCC 


Status 


B. subtilis 


3021 


6051 


neotype 


B. unif lagellatus 


2990 


15134 


type 


B. amyloliquaf aciens 


3061 


23350 


type 
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Table 5. Corresponding strain number of 
strains of B. subtilis 



RH 


NRRL 




ATCC 


3063 


B-354(NRS-231) 


- 


6633 


3064 


B-356(NRS-238) 


- 


7067 


3065 


NRS-265 




6455 


3066 


NRS-659 




7060 


3067 


NRS-730 




7003 


3068 


NRS-737 




943 


3069 


NRS-741 




4344 


3070 


NRS-773 




8188 


3071 


NRS-1106 




4944 


3072 


NRS-1107 




7480 


• High 


molecular weight DNA was 


isolated 


from each of 


the strains. Liquid DNA-DNA hybridization data was 


collected 


using RH 3021 and RH 2990 


labeled 


DNAs and 



20. results are shown in Table 6. 
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TABLE 6 

Ifercent hybridization between labeled 
DNA probe and ENA from strains of B. subtilis 



10. 



labeled mk 

probe m 3063 RH 3064 EH 3066 RH 3067 RH 3068 BH 3065 

m 3021 61 77 51 96 84 . 18 

m 2990 12 10 13 15 16 50 

RH 3069 RH 3070 RH 3071 RH 3072 RH 3021 RH 2990 
RH 3021 14 93 15 100 20 

PH 2990 100 — 17 100 20 100 



15. RH 3061 

RH 3021 11 
RH 2990 70 
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The data shows there are two hybridization groups. 
Similar data- is reported in the literature for B. 
subtilis { Seki et al^ International Journal of Systematic 
Bacteriology 25 ;258-270 (1975) )• These two groups can be 
5. represented by RH 3021 and RH 2990. When restriction 
endonuclease analysis of ribosomal RNA genes is carried 
out, EcoR I digests (FIGURE 5) can be separated into two 
groups. The group represented by RH 3021 has two in- 
tensely hybridized fragments (2.15 and 2.1 KBP) . The 
10. group represented by RH 2990 has two intensely hybrid- 
ized fragments (2.6 and 2.5 KBP). The EcoR I data can 
be used to place B. subtilis strains in appropriate DNA- 
DNA hybridization groups. According to the DNA-DNA 
hybridization 70% rule, B. subtilis is actually two 
15. species. However, when the PST I data (FIGURE 6); is 

considered, it is possible to think of the groups as two 
divergent populations related to a common ancestor or 
speciation event. The conclusion that B, subtilis is one 
species correlates with phenotypic data. The strains 
20. listed in Table 5 are identified as B. subtilis in 

Gordon, R. E. et al "The Genus Bacillus", Agriculture 
Handbook No. 427 , Agricultural Research Service, -U.S. 
Dept. of Agriculture, Washington, D.C. pp. 36-41. 
Restriction analysis can provide data which is comparable 
25. to DNA-DNA hybridization data, or by selecting the proper 
enzyme, restrction analysis can adequately define species 
despite divergence, RH 3061 has lost PST I sites. How- 
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5. 



ever, the EcoR I data suggests that the strain is Bj_ 
subtilis . The same is concluded from the Bgl II data 
(FIGURE 7) and Sac I data (FIGURE 8). 

Reference Experiment 3 
aability of the Itestriction Analysis 
Kttern and Other 
Bacillus polvmyxa Eicperinents 



10. 



TABLE 7 

Ifeotype strains of B. subtilis and B. polyntyxa 



15. 



Sp&cxes 
B. subtilis 
B. polymyxa 
B. polymyxa 

B. polymyxa 
B. polymyxa 



RH 
3021. 
3074 
3033 

3062 
3073 



ACTT 
6051 
842 



NRRL 



NRS-1105 



20. 



Gsmnents 

neotype 

neotype 

asporogenous 
nutant derived 
from RH 3074 

neotype 

asporogenous 
nutant derived 
from NRS-1105 
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B> subtilis and B. polymyxa can be distinguished by EcoR 
I data (FIGURE 9), PST I data (FIGURE 10) Bgl II data 
(FIGURE 11, left) and Sac I data (FIGURE 11, right). It 
can be concluded from the major differences in the PST I 

5. band patterns that bacillus polymyxa is in the wrong 

genus. While both species produce spores, they are not 
phenotypi-cally similar. It is reassuring that the type 
strain of B. polymyxa from both culture collections, ATCC 
and NRRL have the same band patterns. The important 

10. data, however, is that the asporogenous mutants can be 

identified. It -is very difficult, perhaps impossible, to 
identify Bacillus species if they fail to form spores. 

Reference Experiment 4 
15. Identification of a Bacterial Species in House 

Tissue without Isolation 

A Swiss mouse, Mus musculus domesticus (inbred 
strain), v;as inoculated intraperitoneally with 0.5 ml of 
a turbid, suspension of Streptococcus pneumoniae RH 3077 
20. (ATCC 6303). When the mouse became moribund, the heart, 
lungs, and liver were removed. High molecular weight DNA 
was isolated from these tissues, S. pneumoniae RH 3077 
and Swis3S mouse organs, and the procedure for restriction 
endonuclease analysis of rRNA genes was carried out using 
25. EcoR I to digest the DNAs. In addition to washing the 
filters in 3 X SSC, they were washed for 2 X 15 minute 
periods in 0.3 X SSC and 0.05% SDS. Autoradiography was 
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carried out for 48 hr. The data (FIGURE 12) shows that 
S> pneumoniae can be defined by seven hybridized 
fragments (17.0, 8.0, 6.0, 4.0, 3.3, 2.6 and 1.8 KBP). 
The bacterial cDNA probe hybridizes poorly to tv70 mouse 
DNA fragments (14.0 and 6.8 KBP). Hybridized fragments 
signal the presence of S. pneumoniae in the infected 
tissues. All seven bands can be seen in the heart DNA 
extract. They are less intense in the liver DNA extract, 
but all can be seen in the autoradiograph . Only the 6.0 
KBP band appears in the lung DNA extract. The lesser 
number of bacteria in the lungs can be explained by the 
mouse having septicemia rather then pneumonia. The lungs 
showed no consolidation at autopsy. In order to deter- 
mine the sensitivity of the assay, bacterial DNA was 
diluted with mouse DNA and electrophoresed. All seven 
bands can be seen in the autoradiograph v;hen 0.1 micro- 
grams of bacterial DNA is used. The 17.0, 8,0 and 6.0 
KBP bands can be seen with 10""^ \xg of bacterial DNA. Tf 
the figure of 5 X lO""-^ pg DNA per 10^ S> pneumoniae cells 
is used (Biochim Biophys Acta , 26:68) , 10^-^ yg is 
equivalent to 2 X 10^ cells. The present technique is 
thus useful for diagnosing infections at this level of 
sensitivity. 

This reference experiment also demonstrates that .the 
bacterial probe hybridizes with mouse-specific EcoR I frag- 
ments (see FIGURE- 9, fragments having 14.0 and 6.8 KBP). 
These gragments correspond, to EcoR I fragments detected by 
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mouse 18S and 28S ribosomal RNA probe (FIGURE 14 infra 
shows that the 6»8 KBP fragment contains the 28S rRNA 
sequences). The bacterial probe does not hybridize as 
well to mammalian ribosomal RNA gene sequences, so the 

5. bands are less .intense, the system of bacterial probe and 
nuclear mammalian DNA is less sensitive, and selectivity 
for DNA from infecting prokaryotes is clearly demon- 
strated. In experiments where bacterial probe was 
hybridized to 10 pg digested bacterial DNA per lane, no. 

lO. hybridization to 10 pg digested human or mouse DNA per 
lane was detected on the autoradiographs when the 
bacterial bands were clearly seen. 

Reference Experiments 5-8 

15. Mammalian experiments 

These reference experiments illustrate that the con- 
cept of rRNA restriction analysis to identify organisms 
can be successful!;^ applied not oj>ly -to bacteria but to - 
complex eukaryotic organisms. 

20. FIGURE 13 shows that mammelian genera can be 

recognized . with Mus musculus domesticus 18S and 28S rRNA 
probe, and that several species of Mus can be distin- 
guished. In this figure, the enzyme is PST I and the 
subjects and corresponding bands are as follows: 

25. 1, Mus nusculus melossinus (mouse) 14. 5# 13. 5^ 2.6 

2. Mus musculus domesticus (mouse) 13.5, 2.6 

3, Canis f amiliaris (dog) 12.0 
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^- Casria porcellus Cq^ir^a pig^ 11. Of 14*0| 13»0, 

8.8, 5.7, 4.7^ and one 
band less than 3.0 

5. Cricetulus griseus (hanster) 25. Or 4.7 

6. Homo sapiens (human) 15. Of 5.7 

7. Bslis catus (cat) 20. 0^ 9.7 

8. Ratus norvegicus (rat) 12.5 

9. Mas nusculus dcmestiois (mouse) 13.5^ 2.6 

10. Mus ceirvicolor cervicolor (mouse) 14.0, 2.7 

11. Mus oervicolor papeus (mouse) 13.5, 2.6 

12. Mus pahari (mouse) 13.0, 3.7 

13. Mus codkii (mouse) 13.5, 2.6 

FIGURE 14 shows that mouse and cat DNA can be 
distinguished by the 28S rRNA cDNA alone, and that the 
pattern of hybridized bands is dependent on the 
composition of the probe sequences. In Figure 14 the 
enzyme is EcoR I, and the subjects and bands are as 
follows: 

1. Mus musculus domesticus (mouse) 6.8 KBP 
2- Felis catus (cat) 8.3 KBP 

In Figure 15 the enzyme is Sac I, and the subjects 
and bands are as follows: * 

1. Erythrocebus patas (patas monkey) 8.5, 3.7, <3.0 

2. Ifeitus norvegicus (rat) 25.0, 9,5, 3.6, <3.0 

3. Mus musculus domesticus (mDUse) 6.8, <3.0 

4. Felis catus (cat) 9.5, 5.3, 4.0, <3.0, <3.0 

5. Itomo sapiens (human) 10.5, <3.0 
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6. riacaca imilatta (rhesus monkey) 9.8, <3.0 When Figure 15 
(Sac I digests) is compared to the other maramalien 
figures it can be seen the the hybridized pattern is 
enzyme specific. 

5. Figure 16 shows that primates can be distinguish- 

ed. Cell cultures have bands in common with tissue from 
the species of origin, and different human cell cultures 
can be distinguished by additions and deletions of 
bands. In this figure, the enzyme is EcoR I, and the 

10. subjects and bands are as follows: 

1. Ecytlirocebus patas (patas nonkey) >22.0, 11.0, 

7.6, 2.6 

2. tecaca inulatta (rhesus monkey) 22.0, 11.5, 7.6 

15. 

3. Hditd sapiens (human) >22,0, 22.0, 16.0, 8.1, 6.6 

. 4. M 241/88 (langur monkey cell culture) 14.0, 7.2, 5.7 
20. 5. HeLa (human cell culture) >8,1, 6.6 

6. J96( human cell culture) > 22.0, 22.0, 16.0, 11.0, 8.1, • 

6.6 

25. 7. AO . (human cell culture) 22.0, 16.0, .8.1, 6.6 

8. X-381 (rhesus monkey) 22.0,* 11.5, 7.6 
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Example 1 
Use of an H4 His tone Gene Probe 



A computer simulation of the identification and 
characterization of two animal species (sea urchin and 
mouse) was carried out using a conserved DNA sequence 
derived from the H4 his tone gene* 

The histone H4 gene sequence for sea urchin 
(Psammechinus miliaris) is shown below ^ where A, T, C, G, 
represent the known nucleotides ^ and N represent a 
presently unknown position (788 base pairs)* 



10 20 30 40 50 

CAACATATTA GABGAAGGGA GAGA6AGAGA GAGAGA6AGA 6AGA6AGAGA 
GTTGTATAAT CTCCTTCCCT CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT 

60 70 80 90 100 

GGGGGGGBGG GAGGGAGAAT TGCCCAAAAC ACTGTAAATG TAGCGTTAAT 
CCCCCCCCCC CTCCCTCTTA ACBGBTTTTG TGACATTTAC ATCGCAATTA 

110 120 130 "140 150 

GAACTTTTCA TCTCATC6AC TGCGCGTGTA TAAGGATGAT TATAAGCTTT 
CTTGAAAAGT A6AGTAGCTG AC6C6CACAT ATTCCTACTA ATATTC6AAA 

160 170 180 190 . 200 

TTTTCAATTT ACAG6CACTA CGTTACATTC AAATCCAATC AATCATTTBA 
AAAA6TTAAA TGTCCGTBAT GCAATGTAAG TTTA66TTA6 TTA6TAAACT 

210 220 230 240 250 

ATCACCGTCG CAAAA6GCAG ATGTAAACTG TCAAGTTGTC AGATTGTGTG 
TAGTGGCA6C GTTTTCCGTC TACATTTGAC AGTTCAACAG TCTAACACAC 

260 270 280 290 300 

CGCGGCCTCC AGTGAGCTAC CCACCGGGCC GTCGCGGAGQ. 66C6CACCTG 
GC6CC6GA6G TCACTCGATG GBTGGCCCGG CAGCGCCTCa CCGCGTGGAC 

310 320 330 340 350 

TGCGGGAGGG GTCATCGGAG BGCGATCGAG CCTCGTCATC CAAGTCCGCA 
ACGCCCTCCC CAGTAGCCTC CCGCTAGCTC GGAGCAGTAtr GTTCAGGCGT 

360 * . 370 380 390 400 

TAC6GGTGAC AATACCCCCG CTCACC6GGA GGGTTGGTCA ATCGCTCAGC 
ATGCCCACT6 TTATGGBBCC GA6TGGCCCT CCCAACCAGT TAGCGAGTCG 
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410 420 430 440 450 

6AAACGTCCA GTC6TCAGCA TC6CACTAAG ACTCTCTCTC AATCTCCATA 

CTTTGCAG6T CAGCA6TC6T AGCGTGATTC T6AGAGABAG TTAGAGGTAT 

460 470 480 490 500 

ATGTCAGGCC 6TGGTAAA66 AGGCAA6GGG CTCGGAAAG6 GAGGCGCCAA 

TACA6TCC6G CACCATTTCC TCCGTTCCCC GA6CCTTTCC CTCC6CBGTT 

5 510 520 530 540 550 

• GCGTCATCGC AAGGTCCTAC GAGACAACAT CCA6GGCATC ACCAAGCCTG 

CGCAGTABC6 TTCCA6GATG CTCTGTTGTA GGTCCC6TAG T6GTTC66AC 

• 

560 570 580 590 600 

CAATCCGCCG ACTCNNNNNN NNNNNNNNNN NNNNNNGAAT CTCTGGTCTT 
GTTA6GCGGC T6A6NNNNNN NNNNNNNNNN NNNNNNCTTA GAGACCAGAA 

610 620 630 640 650 

ATCTACGA6G AGACACGAGG 6GTGCT6AAG GNNNNNNNNN NNNNNNNNNN 

10-TAGATGCTCC TCTGTGCTCC CCACGACTTC CNNNNNNNNN NNNNNNNNNN 

660 670 680 690 700 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

710 720 730 740 750 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NN6GCC6AAC ACTGTACBGC 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNCCGGCTTB TGACATBCC6 



15. 



760 ■ 770 780 

TTCGGCGGCT AAGTGAAGCA GACTTGGCTA GAATAACG 
AAGCCGCCGA TTCACTTCGT CT6AACCGAT CTTATT6C 
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The analogous mouse H4 gene sequence is shown below 
(968 base pairs): 



10 20 30 40 50 

GAATTCTCCe AGG6ACTTCG GCACCATAAT TAA6AAAATC GAAAATAAAA 
5.CTTAAGAGGC TCCCTGAAGC CGT66TATTA ATTCTTTTA6 CTTTTATTTT 

60 ^ 70 80 90 100 

AAATAAAGGC TTfiAGACIGT AA6GAACC66 TAGAGGGCAG A6AAGAGAAA 
TTTATTTCCG AA€TCTGACA TTCCTT66CC ATCTCCCGTC TCTTCTCTTT 

110. 120 130 * 140 150 

AGAAAAACAG GAAGATGATG CAACATCCA6 AGCCC66ATA ATTTAGAAAG 
TCTTTTTGTC CTTCTACTAC GTT6TAGGTC TCGGGCCTAT TAAATCTTTC 

160 170 ISO 190 200 

6TTCCCGCCC GCGCGCTTTC A6TTTTCAAT CTGGTCC6AT CCTCTCATAT 
CAAGG6CGG6 C6CGCGAAAG TCAAAAGTTA GACCAG6CTA 66AGAGTATA 

210 220 230 240 250 

ATTAGTGGCA CTCCACCTCC AATGCCTCAC CAGCT6GTGT T.TCA6ATTAC 
TAATCACCGT GAGGTG6AGG TTACGGAGTG GTCGACCACA AA6TCTAATG 

260 270 280 290 300 

ATTAGCTATG TCT6GCAGAG 6AAAGGGTGG AAAGGGTCTA G6CAA6GGTG 
TAATC6ATAC AGACCGTCTC CTTTCCCACC TTTCCCA6AT CCGTTCCCAC 
15. : 

310 320 330 340 350 

6CGCCAAGCG CCATCGCAAA 6TCTT6CGT6 ACAACATCCA G6GTATCACC 
CGCGGTTCGC 6BTAGCGTTT CAGAAC6CAC TGTTGTAGGT CCCATAGTGG 

360 370 380 390 400 

AAGCCC6CCA TCCGCCGCCT GGCTCGGCGC 6GTGGGGTCA AGCGCATCTC 
TTC6GGC6GT AGGCGGCGGA CCGA6CCGCG CCACCCCAGT TCGCGTAGAG 

410 ■ 420 430 440 450 

20. CGGCCTCATC TACGA6GAGA CCCGTGGTGT GCTGAAGGTB TTCCTGGAGA 
6CCGGA6TA6 ATGCTCCTCT 6GGCACCACA CGACTTCCAC AAGGACCTCT 

460 470 480 490 500 

ACGTCATCCG CGACGCAGTC ACCTACACCG AGCAC6CCAA 6CGCAAGACC 
TGCAGTAGGC GCTGCGTCAG T6GATGTGGC TC6TGCGGTT CGC6TTCTG6 

510 \" 520 530 540 550 

GTCACCGCTA TGGATGTGGT GTACGCTCTC AAGCGCCAGG 6CCGCACCCT 
25. CAGTGGCGAT ACCTACACCA CATGCGAGAG TTCGCGGTCC CGGCGTGG6A 

560 - 570 580 . 590 600 

CTACGGCTTC GGAGGCTAGA CGCCGCCGCT T.CAATTCCCC CCCCCCCCCC 
6ATGCCGAAG CCTCCGATCT GCGGCGGCGA AGTTAAGGGG G66GG6GGBG 
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610 620 630 640 650 

ATCCCTAAC6 GCCCTTTTTA G6GCCAACCA CAGTCTCTTC A6BAGAGCTG 
TAG6GATTGC CG66AAAAAT CCCGGTTG6T 6TCA6A6AAG TCCTCTC6AC 



660 670 680 690 700 

ACACTGACTT GG6TCGTACA GGTAATAACC GCGGGTTTAG 6ACTCACGCT 

TGTGACTGAA CCCAGCATGT CCATTATTG6 CGCCCAAATC CTGAGT6CGA 

710 720 730 740 750 

• ACTAG6TGTT CCGCTTTTAG AGCCATCCAC TTAAGTTTCT ATACCACGBC 

TGATCCACAA GGCGAAAATC TCGGTAGGTG AATTCAAAGA TATGGTGCCG 



760 770 780 790 800 

GGATAGATAG CATCCAGCAG GGTCTGCTCA CACTG6GAAT TTTAATTCCT 

CCTATCTATC GTAGGTCGTC CCAGACBA6T GTGACCCTTA AAATTAAGGA 

• 810 820 830 840 850 

, Q ACTTAGGBT6 -TGAGCTGGTT- 6TCAG6TCAA BAGACTGGCT-AAGATTTTCT- - 

■*■ * TGAATCCCAC ACTCGACCAA CAGTCCA6TT CTCTGACCGA TTCTAAAAGA 

860 870 880 890 900 

TTAGCTC6TT TGGAGCAGAA TTGCAATAAG GAGACCCTTT GGATGGGAT6 

AATCGAGCAA ACCTCGTCTT AAC6TTATTC CTCT66BAAA CCTACCCTAC 

910 920 930 940 950 

ACCTATGTCC ACACATCAAA TGGCTATGTG GCTGTGTCCC TGTGTTTCCA 
TGGATACA6G TGTGTA6TTT ACC6ATACAC C6ACACAGGG ACACAAAGGT 

15. 960 

ATGAGT6GCT GTGCTTGA 
TACTCAC.CGA CACGAACT 



0120658 

The region of homology for both aforementioned 
sequences is shown below, where asterisks denote not 
homologous portions. Within the region shown, the first 
118 base pairs have 80.5% homology and are used as a 
conserved DNA sequence probe in this example (sea urchin 
(top) base positions 449 to 567 , mouse (bottom) base 
positions 257 to 375): 



449 TAATGTCAGGCCGTGeTAAAGG-AGBCAA-GGBGCTCGG-AAAGGGAGGCGCCAAGCGTC 
257 TA-TGTCTBBCAGA66-AAAGGGTGG-AAAG6BTCTAGGCAA-666TB6C6CCAA6CBCC 



ATC6CAAGGTCCTACGAGACAACATCCAG6GCATCACCAAGCCTGCAATCCGCCGACT 
ATC6CAAAGTCTT6CGTGACAACATCCA6GBTATCACCAAGCCCBCCATCCGCCBCC7 



-C 
G6 



^ ^ ^ ^ 

NNW>IHNNNNNNNNNNNNNNNNN6AATCTCTGGTCTT-ATCTACGAGGAGACACGAGGG-G 
CTCGBCBCGGTGGGGTCAABCG-CATCTCCGG-CCTCATCTACGAGGAGACCCS-TBBTB 



TGCTGAABGNKMNNNNNNNNNNNNNNNNNNNNNNNNMNNNNhJNNbJNNNNNHNNNNNNNNN 
TGCTGAAG6TGTTCCTGGA6AACGTCATCCGCGACGCAGTCACCTACACCGAGCACBCCA 

jjc ^ ^ 

f^JNNNNNNNNNNKNNNNNNNNNNNNNNNNNNNNNNKNNNNNNMNNNNNNNMGGCCG-AAC- 
A6CGCAAGACC6TCACCGCTAT6GATGTGGTGTACGCTGTCAAGC6CCAGGGCCGCACCC 

* ** * * * * * 

ACTQTACGGCTTC66CG6CTAABTGAA6CAGAC — TTGGCTA 780 
TCT— ACGGCTTCGGAGGCTA 6AC6CCBCCGCTT — CAA 584 



y. = 84.503 

P< 342» 289) = ,000E+00 E = «000 
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Restriction endonuclease cleavage sites were 
determined from the two sequences. A list of cleavage 
sites for the sea urchin and mouse sequences is shown 
below. Numbers indicate the 5" side of the cleavage 
5^ site/ unless site name is in brackets, V7hich indicates 
that only the recognition site is known. 

SEA DRCHIN 

Seouence Afpeers at position 



- AchI 


(6PCBQC)- 


495 




Alul 


<AGCT) 


147 


267 


Asul 


(66NCC} 


. 277 


514 


: Avail 


(GGLCC) 


514 




CsuII 


(CCMBB) 


276 


377 


Ddel 


(CTNAG) 


396 


427 


Hpnl 


(GATC) 


326 ■ 




EcoRI* 


(PPATQQ) 


184 




EcoRII 


(CCLGG) 


531 




. Fnu4HI 


<GCNGC) 


254 




• FnuDII 


(CGCG) 


125 


253 


Fokl 


(GGAT6) 


148 




'Fokl 


(CATCC) 


324 


.515 


-Haell 


(PBCGCQ) 


498 




Haelll 


(GGCC) 


256 


279 


Hsal 


(GCGTC) 


491 




HaiCI 


(GOQPCC) 


494 
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H^iJII <GPGCQC> 
Hhal (GCGC) 
Hindi II <AA6CTT) 
Hinfl (GANTC) 
5. Hj^all (CC6G) 

HphI (SGTGA) 
Hphl ' (TCACC) 
Mbol (6ATC) 
10. Mnll (CCTC) 
Mnll (6A6G> 

NsrI (GGCGCC) 
NspBII (GCMGC) 
IS.PvuI (CGATCG) 
ScrFI (CCJ^GG) 
SfsNI (GCATC) 
Taol (TC6A) 



483 

125 253 295 
145 

200 431 561 

275 376 

368 

195 365 532 
324 

267 342 

4 • 42 54 
372 463 484 

495 
256 
327 

276 377 533 
409 527 

117 327 
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MOUSE 



Seouence 


Appears 


at position 


Acyl (GPC6QC) 


302 


u71 




Aflll (CTTAAG) 


731 






iHluI — <AGCT)- 


i"234 •■ 




0*^0 815 


Asul <Gi3NCC) 


184 


u40 


oil 622 


Avail (BGLCC) 


164 




• 


BssHII (GCGCGC) 


1 AO 


• 




CbuII <CCMGB> 


1 






Ddel (CTNAG) 


w VO 


840 


■ 


•Iipnl <GATC) 


1 on 

X TV 




- 


EEcoBIl (AGCANNNNNNNNTCA) 








CEcoPlD <AGACC) 


418 


AO A 




CEcoPlD ^GGTCT) 


285 


771 




CEcoP153 (CAGCAG) 


765 


< 




EcoRI CGAATTO 


2 






Ec-oRI* (PPATQQ) . 


4 






EcoRI I <CCL6G) 


338 


wOO 


■443 53o 


Fnu4HI .-(GCNGC) 


366 


543 




FnuDII (CGCG) 


162 


164 


380 461 


Fbkl (GGATG) 




905 


910 


Fokl <CATCC> 


111 


322 


346 442 


Haell rPGCGCQ) 


• 305 


312 


537 


Haelll (6GCC) 


404 


542 


612 ^24 


Hdal (GiiCGC) 


472 


579 





682 



587 711 748 
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H^iAI (GLGCLC) 485 
H^iCI (BGOPCC) 21 301 



5. 



HSiJII (GPGCQC) 131 

Hhal (6CGC> 164 166 304 311 380 39 

■ 494 536 

Hinfl (CANTO 692 ' 

HpsII (CCGG) 78 135 401 

HphI (TCACC) 220 339 462 495 

Hbol (GATC) 188 

MboII (6AAGA) - . 105 . 124 

MboII (TCTTC) 629 

Mnll <CCTC) ■ 202 227 236 415 559 

Mnll (GAGG) : . 3 76 261 407 555 

Narl C6GC6CC) 302 

NSF-BII (GCH6C) 368 545 576 579 

PvuII (CA6CTG) 234 

Rsal <6TAC) . 523 668 

SacII (CC6CGB) 683 

20. ScrFI (CCNGG) 135 340 370 445 538 

SfaNI (GATGC) 127 

SfaNI (GCATC) 385 751 

Taal (TCGA) 40 

■ Tthllll (6ACMNNBTC) 466 

25. Tthlllll (T6QTTG) 953 . . 

Xmnl (GAANNNMTTC) 439 
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The sea urchin and mouse sequences are compared with 
Hha I (GCGC) and the described probe sequence • The sea 
urchin sequence has cleavage sites at positions 295 and 
497, thus creating a 202 bp fragment, which, if 
5. denatured/ would hybridize with the probe sequence. Hha 
I (GCGC) sites in the mouse sequence, 166, 304, 311 and 
380, indicate that fragments of ^69^ and 138 could be 
detected with the probe sequence • 

Thus the genetic characterization for sea urchin is 
10. 202 
while that for mouse is 

69 
138 

15. Example 2 

Use of trp D Gene of the Tryptophan Operon as a Probe 

The same type of computer simulation as in Example 1 
was carried out using trp D gene as a probe. This allows 
20. the conclusion that E. cold and Salmonella typhimurium 
can be distinguished by restriction fragments containing 
a conserved sequence. 

The E, coli trp D gene with 684 bp's is shown below: 



^c>- 0120658 

10 20 30 40 "50 

GAAGCC6ACG AAACCC6TAA CAAAGCTCGC GCCGTACTGC 6C6CTATT6C 
CTTC6BCTBC TTT666CATT GTTTCGAGCG CBGCATGACG CGCGATAACG 

60 70 -80 90 100 

CACCGCGCAT CAT6CACAGG AGACTTTCTG AT6GCTGACA TTCTGCTGCT 
GTG6CGCGTA GTAC6TGTCC TCTGAAAGAC TACC6ACTGT AA6ACGACGA 

110 120 130 140 150 

5. CGATAATATC GACTCTTTTA CiBTACAACCT B6CAGATCA6 TTGC6CAGCA 
GCTATTATA6 CTGABAAAAT 6CATGTTGGA CCGTCTAGTC AACGCGTCBT 

160 170 180 190 200 

AT666CATAA C6T6GT6ATT TACCGCAACC ATATACCBGC GCAAACCTTA " 
TACCC6TATT GCACCACTAA ATG6CGTTGG TATATGGCC6 CBTTT6GAAT - 

210 220 230 240 250 

ATTGAACGCT TBGC6ACCAT GAGTAATCC6 6TGCTGAT6C TTTCTCCT6G 
TAACTTBCGA ACCBCTGGTA CTCATTAGGC CACGACTAC6 AAABAGGACC 

10. 

260 270 280 290 300 

CCCCGGT6TB .CC6AGC6AAG CCGGTTGTAT GCC66AACTC CTCACCCGCT 
GGGGCCACAC 6BCTCGCTTC GGCCAACATA CGGCCTTBAG GAGTGGGCGA 

310 ■ 320 330 340 350 

T6CGTGBCAA GCTGCCCATT ATTGBCATTT GCCTCGGACA TCABGCGATT 
.AC6CACCBTT CBACBBGTAA TAACCGTAAA CGGAGCCTGT AGTCCGCTAA 

360 370 380 390 400 

15.GTCGAAGCTT ACGGGGGCTA T6TCGGTCAG 6CGGGCGAAA TTCTCCACGG : 
CAGCTTCGAA TGCCCCCGAT ACAGCCAGTC CGCCCGCTTT AAGAGGTGCC 

410 420 ■ 430, 440 450 ■ 

TAAAGCCTCC AGCATTGAAC ATGACGGTCa' GGCGATGTTT GCCGGATTAA 
AT.TTCGGAGG TCGTAACTTG TACTGCCAGT CCGCTACAAA CGGCCTAATT 

460 • 470 ■ 480 490 500 

CAAACCCGCT GCCGGTGGCG CGTTATCACT CGCTGBTTGG CAGTAACATT- 
GTTT66BCGA CGGCCACCGC GCAATAGTGA 6CGACCAACC GTCATTGTAA - 

20- 510 520 530 540 550 ■ 

CCG6CCGGTT TAACCATCAA CGCCCATTTT AATGGCATG6 TGATGGCAGT 
GGCCGGCCAA ATTGGTAGTT GCBGGTAAAA TTACCGTACC ACTACCGTCA 



560 570 580 590 600 

ACGTCACGAT GCGGATC6CG TTTGTGGATT CCA6TTCCAT CCGGAATCCA 
TGCAGTGCTA CGCCTAGCGC AAACACCTAA 6GTCAAGGTA GGCCTTAGGT 

610 620 630 640 650 

25TTCTCACCAC CCAG6GCGCT CGCCTGCTGG AACAAACGCT GGCCTGGGCB 
■*AA6AGTGGTG 6GTCCCGCGA GCGBACGACC TTGTTTGCGA CCGGACCCGC 

660 670 680. 

CAGCATAAAC TAGAGCCAGC CAACACGCTG CAA 
6TCGTATTTG ATCTCGGTCG GTTGTGCGAC GTT 
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The trp D gene, 683 base pairs, for S, typhimurium 
is shown below: 



10 '20 30 40 50 

5. GAAGCCGATG AAACCCGTAA TAAAGCGCGC GCCGTATTGC GTGCTATCGC 
CTTCG6CTAC TTTGG6CATT. ATTTCGC6CG CGGCATAAC6 CAC6ATAGC6 

60 70 ■ 80 90 100 . 

CACCGCGCAT CATGCACAGG AGACCTTCTB .AT6BCTGATA TTCT6CTBCT 
GT6GCGCGTA OTACGTGTCC TCTBBAABAC TACCBACTAT AAGACBACBA 

110 120 ' 130 140 150 

CGATAACATC BACTC6TTCA CTTG6AACCT G6CAGATCAG CTACBAACCA 
6CTATTGTAG CTGAGCAABT GAACCTTGGA CCGTCTABTC GATBCTTGGT 

10. 

160 170 18.0 190 200 

AC6GTCATAA CGTGGTGATT TACCGTAACC ATATTCCGGC GCAGACGCTT ' 
TGCCAGTATT GCACCAC7AA ATG6CATTGG TATAAGGCCG CBTCTGCGAA 

210 220 230 240 250 

ATCGATCGCC TG6CGACAAT GAAAAATCCT GTGCTAATGC TCTCCCCCGG 
TAGCTAGCGG ACCGCTGTTA CTTTTTAGGA CAC6ATTACG AGAGGBGGCC 

260 270 280 290 300 

15.CCC6G6TGTT CCCAGCGAGG CAGGTTBTAT GCCBGAGCTB CTGACCCBAC 
GGGCCCACffA GGGTCGCTCC GTCCAACATA CGGCCTCGAC BACTGGGCTG 

310 320 " 330 340 350 

• ' TAC6CBGCAA GTTACCGATC ATCGGCATTt GTCTBGG6CA TCAGGCGATT 
ATGC6CCGTT CAAT6GCTA6 TAGCC6TAAA CA6ACCCCGT ABTCCGCTAA ^ 

360 370 380 390 . 400 

BTCGAAGCTT ACGGCGGTTA CGTEGGTCAG GCGG6AGAAA TCCTGCAT6G 
. CAGCTTCGAA T6CCBCCAAT GCA6CCAGTC CBCCCTCTTT A6GACGTACC 

20. 

410 420 430 440 450 

CAAAGCCTCC AGCATTGAGC ATGACGBTCA 6GCGAT6TTC GCCBGGCTCB 

GTTTCGGAGB TC6TAACTC6 TACTGCCA6T CCGCTACAAG CGGCCCGAGC 
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460 470 480 . 490 500 

CBAATCCGCT ACCG6TCGCG C6TTATCATT CGCTG6TCG6 CAGTAAT6TT 
BCTTAGGCGA TGBCCAGCGC 6CAATA6TAA GC6ACCAGCC BTCATTACAA 

510 520 530 540 550 

-CCTBCCGGGC— T-GACE-AT-T-AA-e6CCCAT-T-TC-AAC8GCATGB-TGAT6BCGeT 
GBACGGCCCC ACT6BTAATT GCGGGTAAAG TTBCC6TACC ACTACCGCCA 



560 570 580 590 600 

ACGTCATGAT GC6GATC6CG TTTGCGGTTT TCAATTTCAT CCCGAGTCCA 
T6CAGTACTA CGCCTA6CGC AAACGCCAAA AGTTAAAGTA GBGCTCAGGT 

610 620 630 " 640 650 

TCCTGACGAC ACA6BGCGCG CGTCTACTG6 A6CAAACATT ABCDTB6BC6 
AGGACTGCTG T6TCCCGCGC BCAGAT6ACC TCGTTTGTAA TCGGACCC6C 

660 670 680 

10. CTGGCGAAGC TGGAACCGAC CAACACCCTA CAG 
GACC6CTTC6 ACCTT6GCTG 6TTGTGGGAT 6TC 
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Two homology regions between both sequences were 
next established, where the upper sequence if for E. coli 
and the lower one for S.tvphiinuriun>i 

Region I 



^ ^ ^ ' ^ 4 ^ ^ ^4 

•!* i^^N 

452 AAACCCGCTGCC66T6GCGC6TTATCACTCGCTGGTT6GCA6TAACATTCC-G6CCGB-T 

453 AA-TCCGCTACCB6TCGCGCBTTATCATTCGCTGGTCG6CAGTAATGTTCCTGCC-BBBC 

TTAACCATCAAC6CCCATTTTAATGBCATGGTGATGGCA6TACGTCACGAT6CGGATCBC 
TGA-CCATTAAC6CCCATTTCAACBGCATBGTGATGBCBBTACBTCATBATGCBGATCBC 

^ ^ ^ ^ *i* ^ ^^^^ ^* 4f 4 *' 

/GTTTGTGG-AT-TGCAGTTECATCCSBAA-TCCA-T-Te-TGACCACGCAGGGGBGT-GBEC-TGCT 
GTTT6CBBTTTTC-AATTTCATCCCBAGTCCATCCTBACBACACA66GCGCBCBTCTACT 

■Sti ^1:^ t * * * in * % ^ t 

G6AACAAACGCTGGCCTGGGCGC-AGCATAAACTAGAGCC-AGCCAACACGCTGCA 682 
BBAGCAAACATTABCCTGGBCGCTBBC-BAABCTBGAaCCGACC-AACACCCTACA 682 



% = 78.390 

P( 236 » 185) = .OOOE+00 



4 000 
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Region II 



1 6AAGCC6ACGAAACCCGTAA-CAA-A6CTCGCGCCGTACTGCGCGCTATTGCCACCBC6C 
1 GAAGCCGATGAAACECGTAATAAABCGCGCeC~CBTATTGCGT6CTATC6CCACCECGC 

' * * « * * 

ATCAT6CACA66AGAC-TTTCT6ATGGCT6ACATTCTGCTGCTCGATAATATCGACTC-T 
ATCATGCACA6GAGACCTT-CT6AT66CTGATATTCTGCTGCTCGATAACATCGACTCGT 

******** ******** 
TTTAC—GTACAACCTGGCAGATCAGTTGCBCAGC-AATBGG-CATAACGTGGTGATTTA 

T-CACTT6GA-ACC-T66CAGATCA6CTACG-AACCAA-CG6TCATAACGTGGTGATTTA 

* * ** ****** * *«* 

CCBCAACCATATACC65C6CAAAC-CTTAATTGAACGC-TTB6C6ACCATGAGTAA-TCC 
CCGTAACCATATTCCGGCGCAGAC6CTTA-TCGATCGCCTG6-CGACAATGA-AAAATCC 

* * * * * * ********** 
GGTGCTGATGCT-TTCTCCTGGCCCCGG-TGT-GCC-GAGCGAAGCCGBTTGTATGCCGG 
TGT6CTAATGCTCTCC-CCC6GCCC-6GGTBTTCCCAGCG — AG6CA6GTTBTATGCCGG 

*** *** **■ ***** ****. 

AACTCCTCACCCG-CT-T6CBTB6CAAGCT6CCCATTATTGGCATTTGCCT-CGGACATC 
AGCTGCTGACCC6ACTACGCG — BCAAGTTACCGATCATCG6CATTT6TCTGGGG-CATC 

**** *■ **** 

AGGCGATT6TC6AAGCTTACGG-GGGCTATBTCGGTCAG6CG6GCGAAATTCTCCACG6- 
■ AGGCGATTGTCGAAGCTTACGGCG6-TTACGTCGGTCAGGCGGGABAAATCCT6CATG5C 

* * * * 
TAAAGCCTCCAGCATTGAACATGACBGTCA6GCGATGTTTGCCGG 445 
AAA-GCCTCCA6CATTGAGCATGACGGTCAGGCGATGTTC6CCG6 . 445 

% — ~ SO ♦SIS * * 

P( 465*^373) = «0O0E+00 ' E * .OOO 
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Restriction sites in both sequences are shown as 
follows: 

E, coli 

.an (CCGG> • 187 229 254 272 283 ,,3 502 

Phi (G6TBA) 177 553 

-hi CTCACC) ■ 285 597 " 

:.oI (GATC) 135 5^4 . ^ ' , • 



S . typhiinuriviin 

<CCG6) 187 248- 253 283 443 4^3 506 

hi (GGTBA) 177 552 

ol .(GATC) 135 204 317 564 

II (CCTC) 417 
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The E> coli sequence has Mbo I (GATC) sites at 135 
and 554. There is a 429 bp fragment that can be detected 
by both Region 1 and 2 probes. The same enzyme has sites 
at 135 f 204, 317, and 564 in the S. typhimurium 
sequence. A probe of the two homology regions would 
detect fragments of 69 , 113 and 247 bp. 

Thus the identifying genetic characterization for EL> 
coli with this probe and enzyme is 

429 

whereas that for S. typhimurium is 

69 

113 . 
247 
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Use of c-Fetoprotein Gene as a Probe 

This example shows the use of a region of homology 
5. in the a-f etoprotein gene sequence of human and rat, and 
the endonuclease Mnll (GAGG). 

The human a-f etoprotein message cDNA (1578 bp's) is 
as follows: 



10 20 30 40 50 

10. AGCTTGBCAT AGCTACCATC ACCTTTACCC A6TTTGTTCC 6GAAGCCACC 
. TCGAACCGTA TCGATGBTAG TG6AAAJG6G TCAAACAA6B CCTTCG6T66 

60 70 80 90 100 

GAGGAGGAA6 TGAACAAAAT 6ACTAGCGAT GTGTTGGCTG CAATGAA6AA 
CTCCTCCTTC ACTTGTTTTA CTGATCGCTA CACAACCGAC GTTACTTCTT 

110 120 130 140 150 

AAACTCTGGC 6AT6G6TGTT TAGAAAGCCA GCTATCTGTG TTTCT66ATG 
TTT6A6ACC6 CTACCCACAA ATCTTTC6ST CBATAGACAC AAABACCTAC 

15.- 

160 170 180 190 200 

AAATTTGCCA TGAGACGGAA CTCTCTAACA A6TAT6GACT CTCAGGCT6C 
TTTAAAC6GT ACTCT6CCTT 6AGA6ATTGT TCATACCTGA 6AGTCC6ACG 

210 220 230 240 250 

TGCAGCCAAA GTGGA6TGGA AAGACATCAG T6TCTGCTGG CAC6CAAGAA 
ACGTCGGTTT CACCTCACCT TTCTGTAGTC ACAGACGACC 6TGCGTTCTT 

260 270 280 290 300 

20. GACT6CTCC6 GCCTCTGTCC CACCCTTCCA GTTTCCAGAA CCTGCCGA6A 
CTGACBA6GC CGGA6ACA66 6TG6BAABBT CAAAG6TCTT 6GACGGCTCT 

310 320 330 340 * 350 

GTTGCAAAGC ACATGAA6AA AACAGG6CAG TGTTCAT6AA CAGGTTCATC 
CAACGTTTC6 T6TACTTCTT TTGTCCCGTC ACAAGTACTT GTCCAA6TAG 

360 370 380 390 400 

TATGAAGTGT CAAGGAGGAA CCCCTTCATG TATG€CCCAG CCATTCTGTC 
ATACTTCACA 6TTCCTCCTT GGGGAA6TAC ATACG6GGTC GGTAAGACAG 



25, 



410 420 430 440 450 

CTT6GCTGCT CAGTACGACA AGGTCGTTCT GGCATGCTGC AAAGCTGACA 
6AACCGACGA GTCAT6CTBT TCCAGCAABA CCGTACGACG TTTCGACTBT 
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5. 



460 470 480 490 500 

ACAA6GAGBA GTGCTTCCAG ACAAA6AGAG CATCCATTGC AAA6GAATTA 
TBTTCCTCCT CAC6AAB6TC TGTTTCTCTC GTAG6TAAC6 TTTCCTTAAT 

510 520 530 540 550 

AGAGAAG6AA GCATGTTAAA T6A6CAT6TA T6TTCA6T6A TAA6AAAATT 
TCTCTTCCTT CBTACAATTT ACTCGTACAT ACAA6TCACT • ATTCTTTTAA 

560 570 580 590 60Q 

T66ATCCCGA- AACCTCCAGB CAACAACCAT TATTAAGCTA AB7CAAAAGT 
ACCTAGGGCT TTGGA66TCC 6TTGTT66TA ATAATTCGAT TCAGTTTTCA 

610 620 630 640 650 

'•■ TAACT6AABC AAATTTTACT BA6ATTCABA A6CTBBCCCT 66ATGTBBCT 
ATTBACTTCB TTTAAAATBA CTCTAAGTCT TCGACCB66A CCTACACCBA 

660 670 680 690 700 

TO CACATCCACB AGGA6TGTT6 CCAAG6AAAC TCBCTGGAGT GTCTGCAG6A 
BTBTABBTBC TCCTCACAAC B6TTCCTTTG AGC6ACCTCA CAGACGTCCT 

710 720 730 " 740 • 750 

TG6G6AAAAA GTCATGACAT ATATATGTTC TCAACAAAAT ATTCTGTCAA 
ACCCCTTTTT CAGTACTGTA TATATACAAG AGTTGTTTTA-TAA6ACA6TT 

760 770 780 790 800 

GCAAAATAGC AGAGTGCTGC AAATTACCCA TGATCCAACT A6GCTTCTGC 
CGTTTTATC6 TCTCACGACG TTTAATG6GT ACTA6GTTGA TCCGAA6ACG. 

15v • - • . 

810 820 830 840 850 

ATAATTCACB CABAGAATGG CGTCAAACCT GAAGGCTTAT CTCTAAATCC 
TATTAAGTBC 6TCTCTTACC GCAGTTTGGA CTTCCGAATA GAGATTTAGG 

860 * 870 880 890 900 

AAGCCAGTTT TTGBGAGACA GAAATTTTGC CCAATTTTCT TCAGAGGAAA 
TTCGGTCAAA AACCCTCTGT CTTTAAAACG GGTTAAAAGA AGTCTCCTTT 

910 920 930 940 950 

. 20.aaatcatgtt catg6caagc tttcttcatg aatactcaag aactcacccc 
'tttagtacaa gtaccgttcg aaagaagtac ttatgabttc ttgabtgggg 

960 970 980 , * 990 1000 

AACCTTCCTB TCTCAGTCAT TCTAA6AATT BCTAAAACGT ACCAGBAAAT 
TTG6AAGBAC A6AGTCAGTA AGATTCTTAA C6ATTTTGCA TBGTCCTTTA 

1010 1020 1030 1040 105O 

ATTGGAGAAG TGTTCCCAGT CTGGAAATcf ACCTGGATGT CA66ACAATC 
TAACCTCTTC ACAAGBGTCA GACCTTTAGA TGGACCTACA GTCCTGTTAG 
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1060 1070 1080 1090 1100 

TG6AA6AA6A ATT6CA6AAA CACATC6AGG AGAGCCAGGC ACTGTCCAAG 
ACCTTCTTCT TAACGTCTTT GTGTA6CTCC TCTCGGTCCB T6ACAB6TTC 

1110 1120 1130 1140 1150 

, CAAAGCTBCG CTCTCTACCA GACCTTA6GA GACTACAAAT TACAAAATCT 
6TTTCGACGC GA6AGAT6GT CTGGAATCCT CT6ATGTTTA ATGTTTTAGA 

5. IIAO 1170 1180 1190 1200 

GTTCCTTATt GGTTACACGA 6GAAA6CCCC TCA6CTGACC TCAGCA6AGC 
CAAGGAATAA CCAATBTGCT CCTTTCGGGG AGTCGACTGG AGTCGTCTCG 

1210 1220 1230 1240 1250 

TGATCGACCT CACCGGGAAG AT6GT6AGCA TTGCCTCCAC GTGCTGCCAG 
ACTA6CTBGA 6TG6CCCTTC TACCACTCGT AAC66A6GTG CACGACGGTC 

1260 1270 1280 1290 1300 

CTCAGC6AGG AGAAATG6TC CGGCTGTGGT GA6GGAATGG CCBACATTTT 
10.' GAGTC6CTCC TCTTTACCAG 6CCGACACCA CTCCCTTACC 6GCTGTAAAA 

1310 . 1320 1330 1340 1350 

CATTGGACAT TTBTGTATAA GGAAT6AA6C AA6CCCTGTG AACTCT6GTA 
6TAACCTGTA AACACATATT CCTTACTTCG TTCGGGACAC TTGA6ACCAT 

1360 1370 ■ 1380 1390 1400 

TCAGCCACTG CTGCAACTCT TC6TATTCCA ACAGGAGGCT AT6CATCACC 
AGTCGGT6AC GACGTT6A6A AGCATAAG6T T6TCCTCCGA TACGTAGT6B 

15. 1410 1420 1430 1440 ~ 1450 

AGTTTTCTGA 66GAT6AAAC CTATBCCCCT CCCCCATTCT CTBAGGATAA 
TCAAAAGACT CCCTACTTTG 6ATAC6G6GA GGGG6TAA6A GACTCCTATT 



1460 1470 1480 1490 1500 

ATTCATCTTC CACAAGGATC GT6CCAAGCT C66CAAA6CC CTACA6ACCA 

TAAGTAGAAG 6TGTTCCTAG CACGGTTCGA GCCGTTTCGG BAT6TCTGBT 

1510 1520 1530 • 1540 1550 

20. TGAAACAAGA 6CTTCTCATT AACCT6GTGA AGCAAAAGCC TBAACTGACA 

ACTTTGTTCT C6AAGAGTAA TTG6ACCACT TCGTTTJCGG ACTTGACTGT 

1560 1570 
GAG6AGCA6C TGGCGGCT6T CACTGCA6 
CTCCTCGTCG ACCGCCGACA GTGACGTC^ 
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The rat o-f etoprotein 3' end cDNA is as follows (54 0 
bp's): 



10 20 30 40 50 

5. GA6GGACTGG CCGACATTTA CATTGGACAC TTBTGTTTAA GACAT6A66C 
CTCCCTBACC GGCTGTAAAT 6TAACCTGTG AACACAAATT CTGTACTCCS 

60 70 80 90 100 

AAACCCTGTG AACTCCGGTA TCAACCACTG CTGCA6TTCC TCGTATTCCA 
TTTGGGACAC TTGAGGCCAT AGTTGBTGAC 6AC6TCAAGG AGCATAAG6T 

110 120 130 140 ' 150 

ACAG6A6GCT CTGCATCACC A6CTTTCTGA G6GACGAAAC CTAC6TCCCT 
TGTCCTCCGA 6AC6TAGTGG TCGAAA6ACT CCCT6CTTT6 GAT6CAGGBA 

10. 

160 170 180 190 200 

CTACCATTCT CTGCGACAAA TTCATCTTCC ACAAG6AATC TGTGCCAAGC 
GATGGTAAGA GACGCTGTTT AAGTAGAAGG TGTTCCTTAG ACACGGTTCG 

210 220 230 - . 240' 250 * 

TCAGGGCCGA GCACCACA6A CCAT6AAACA AGAGCTTCTC ATTAACCTAG 
A6TCCCGGCT C6TG6TGTCT 66TACTTTGT TCTCGAA6A6 TAATT6GATC 

260 270 280 290 300 

15. TGAAACAAAA GCCTGAAATG ACA(3AGGAGC AGCACGCGGC TGTCACTGCT 
ACTTTGTTTT C66ACTTTAC TGTCTCCTCG TCGTGCGCCG ACAGTGACGA 

310 320 330 340 350 

GATTTCTCTG GCCTCTTGGA GAAGTGCTGC AAAGACCAGG ATCAGGAAGC 
CTAAAGAGAC CGGAGAACCT CTTCACGACG TTTCT6GTCC TA6TCCTTCG 

360 370 380 390 400 

CTGTTTCGCA AAAGA6GTCC AAGTTGATTT CCAAACTC6T GAGGCTTTGB 
GACAAAGCGT TTTCTCCAGG TTCAACTAAA 6GTTTGAGCA CTCCGAAACC 

20. 

410 420 430 440 450 

GGGTTTAAAC ATCTCCAAGA GGAAGAAAG6 ACAAAAAAAT GTGTCGACGC 
CCCAAATTTG TAGAGGTTCT CCTTCTTTCC TGTTTTTTTA CACAGCT6CG 

460 470 480 490 500 

TTTGGTGTGA GCTTTTCGGT TTGATG6TAA CT6GTGGAGA CTTCCATGT6 
AAACCACACT CGAAAAGCCA AACTACCATT 6ACCACCTCT GAAGGTACAC- 

510 520 530 540 

25. GGATTTCTAT GCCTAAGGAA TAAAGACTtT TCAACTGTTA 
CCTAAAGATA CGGATTCCTT ATTTCT6AAA AGTTGACAAT 
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The homologous regions between both is as -follows/ " 
(human: upper; rat: lower): 



*** * * * * * • t 

1367 CTCTTCGtATTCCAACA6GAGGCTATGCATCACCAG-TTTTCTBAGGGATGAAACCTATE 
90 CTC 6TATTCCAACA6SAGGCTCTGCATCACCAGCTTT-CTGAGGGACGAAACCTAC6 

t 

% %K. % %% " . % % 

CCCCTC-CCCCATTCTCTGA6GA-TAAATTCATCTTCCACAA6GA-TC-GTGCCAA6CTC 
TCCCTCTACC-ATTCTCTG-CGACAAA-TTCATCTTCCACAAGGAATCTGTGCCAAGCTC 

%%%%%%% - * ■* 

-GG — CAAAGC-CCTACAGACCATGAAACAAGAGCTTCTCATTAACCTGGTGAAGCAAAA 

AGGGCC6A-6CACC-ACAGACCATGAAACAAGAGCTTCTCATTAACCTAGTGAAACAAAA 

* ' ** 

GCCTGAACTGACA6AGGAGCAGCTGGCGGCTGTCACTGC vl576 
6CCTGAAATGACAGAGGAGCAGCACGCGGCTGTCACT6C '299 

% = 85.8^5 

P( 219r 1S8) = .OOOE+00 E = .000 
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Restriction sites for both human and rat are as 
follows : 

HUMAN 



MboII "(GAAGA) 
ttboll (TCTTC) 

Mrill (CCTC) 

Hnli CGAGG) 



108 261 328 1066. 1069 1230 

881 916 1361 1449 

273 574 1190 1200 1219 1243 1439 

44 47 358 449 653 887 1070 

1250 1274 1378 1402 1436 1544 



RAT 

MboII (GAAGA) - 435 

MboII CTCTTC) 168 
Mnll .(CCTC) : 100 159 " 323 

Mnll (GAGG) 39 98 122 267 357 384 Ml 
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It can be calculated that fragments containing a 
portion of the conserved sequence, ^2£, _34^, 104 and 108 bp 
describe the human DNA. Fragments of 24r 59# 9[0_ and 145 
describe the rat DNA. While both sequences contain the 
J4_ bp fragment, it is the set of fragments (taxonomic 
characters) that is of significance. 
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CIAIMS 

1. A method of characterizing an unknown organism 
species which comprises: 

determining the position of part or whole of evoLu- 
tionarily conserved sequences in genetic material of said 
5- organism relative to a known position in said genetic 

material (other than by determining the chromatographic 
pattern of restriction endonuclease digested DNA from said 
known organism, which digested DNA has been hybridized or 
reassociated with ribosomal ENA information-containing 
10 • nucleic acid from or derived from a known probe organism) , • 
thereby to obtain an identifying genetic characterization 
of said unknown organism, and 

comparing said characterization with information 
from at least two sets of identifying genetic character- 
is. izations derived from the same conserved sequences, each 
of said sets defining a. known organism species. 

2. A method as claimed in Claim 1, wherein the 
known position is defined by one or more restriction endo- 
nuclease cleavage sites. 
20* 3. A method as claimed in Claim 1, wherein the 

genetic material is DNA. 

4. A method as claimed in Claim I, which ccmprises 
comparing the chromatographic pattern of restriction endo- 
nuclease digested DNA from said unknown organism, which 
digested DNA has been hybridized or reassociated with 
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conserved DNA sequence information-containing nucleic acid 
from or derived from a probe organism or frcHD a consensus 
sequence, with equivalent chromatographic patterns of at 
least two known different organism species. 

5« 5. A method as claimed in Claim A, wherein said 

conserved DNA information-containing nucleic acid is 
detectably labelled. * ^ 

6. A method as claimed in Claim 5, wherein said 
conserved DNA infomation-containing nucleic acid is 

10. radiolabelled or metal labelled. 

7* A method as claimed in Claim 4,5 or 6, wherein 
said conserved DNA information-containing nucleic acid 
probe is an RNA probe. 

8. A method as claimed in Claim 4, .5 or 6, wherein 
15. said conserved DNA information-containing -nucleic acid 

probe is DNA complementary to RNA. 

9. A method as claimed in Claim 4,5 or 6, wherein 
said conserved DNA information- containing nucleic acid 
probe is DNA obtained by nick-translating or cloning DNA 

20. complementary to RNA. 

10. A method as claimed in any one of Claims 1 to 4, 
wherein said unknown organism being characterized is a 
cell or cells of .a strain in an in vitro culture. 

11. A method as claimed in any one of claims 1 to 4, 
25. wherein said unknown organism being characterized and said 

probe organism are both from the same kingdom, subkingdom. 
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division, subdivision, phyliaiu, subphylum/ class, sub- 
class, order, family, tribe or genus. 

12. A method as claimed in Claim 4, wherein said 
unknown organism being characterized and said probe 
organism are both eukaryotic- - 

13. A method as claimed In Claim 4, wherein said 
ujiknown organism being characterized and said probe 
organism are both prokaryotic. 

14. A method as claimed in Claim 4, wherein said 
unknown organism being characterized is eukaaryotic and 
said probe organism is prokaryotic* 

15. A method as claimed in Claim 12 or 14, which 
further comprises detecting for the presence of a nucleic 
acid sequence or sequences crisating a taxon below the rank 
of species or an infrasubspecific s\ibdivision- 

16. A method as claimed in Claim 4, wherein said 
unknown organism being characterized is prokaryotic and 
said probe organism is eukaryotic. 

17. A method as claimed in Claim 23 or 26, wherein 
said prokciryotic organism being characterized is select- 
ively being detected while in the presence of a eukaryotic 
organism. 

18. A method as claimed in Claim 17, \rtierein said 
prokaryotic organism is a bacterium. 

19. A method as claijned in Claim 22, wherein the DNA 
from said etakaryotic organism being characterized is 
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nuclear DNA, and the conserved DNA information-containing 
nucleic acid from said eukaryotic probe organism is not 
derived from mitochondria or chloroplasts. 

20. A method as claimed in Claim 12, wherein the DNA 
5. from said eukaryotic organism being characterized is mito- 
chondrial DNA and the conserved DNA information-containing 
nucleic acid from said eukaryotic probe organism is derived 
from mitochondria or chloroplasts. 

21. A method as claimed in Claim 12, wherein DNA 

10. from said eukaryotic organism being characterized is chlo- 
roplast DNA and. the conserved DNA information-containing 
nucleic acid from said eukaryotic probe organism is derived 
from mitochondria' or chloroplasts. 

22. A method as claimed in Claim 14, wherein said 
15. DNA from said eukaryotic organism being characterized is 

derived from mitochondrial DNA. 

23. A method as claimed in .Claim 14, wherein said 
DNA from said eukaryotic organism being characterized is 
derived from chloroplast DNA. 

20- 24. A- method as claimed in Claim 16, wherein said 

conserved DNA information-containing nucleic acid probe. 

is derived from mitochondria or from chloroplasts. 

25. A method as claimed in Claim 12 or 14, which 

further comprises identifying in said unknown eukaryotic 
25. organism being characterized a virus, or a virus-derived 

DNA creating a taxon below the rank of species. 
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26. A method of identifying a bacterium of an un- 
knovm bacterial strain present in a sample which comprises: 

determining the position of part or whole of evolu- 
tionarily conserved sequences in DNA of said bacterium, 

5^ relative to a known position in said DNA (other than by 
determining the chromatographic patfern of restriction 
endonuclease digested DNA from said unknown organism, 
which digested DNA has been hybridized or reassociated 
with ribosomal RNA informations-containing nucleic acid 

10. from or derived from a known probe organism) , thereby to 
obtain an identifying genetic characterization of said 
unknown bacterium, and 

comparing said characterization with information 
from at least two sets of identifying genetic character- 

15. izations derived from the same conserved sequences, each 
of said sets defining a known bacterial species. 

27. A method as claimed in Claim 26, wherein the 
known position is defined by one or more restriction endo- 
nuclease cleavage sites. 

20. 28. A method as claimed in Claim 26, whicb comprises 

comparing the chromatographic pattern of restriction-endo- 
nuclease digested DNA from said unknown bacterium, which 
digested DNA has been hybridized or reassociated with, 
conserved .DNA sequence information-containing nucleic acid 

25. from or derived from a probe bacterium or from a consensus 
sequence, with eqiiivalent chromatographic patterns of 
Jcnown bacteria. 
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29. A method as claimed in Claim 26, wherein said 
unknown bacterium is present in a fermentation medium or 
in a secretion or excretion product. 

^ 30. A method as claimed in Claim 26, wherein said 
unknown bacterium is present in or associated with etakary- 
otic tissue. 

31. A method as claimed in Claim 30, wherein said . 
bacterium is present in or associated with animal or plant 
cells. 

32. A method as claimed in Claim 30, wherein said 
bacterium is present in or associated with hiaman cells, .or 
associated with plant root cells. 

33. A method as claimed in any one of Claims 28 to 
32, wherein said conserved DNA information-containing : 
nucleic acid from said probe bacterium is detectably - 
labelled. 

34. A method as claimed in Claim 33, wherein said: 
label is a radiolabel or a metal label. 

35. A method as claijaed in Claim 33, wherein said 
nucleic acid from said probe bacterium is RNA. 

36. A method as claimed in Claim 33 ^ wherein said, 
nucleic acid from said probe bacterium is complementary* 
DNA to RNA. 

37. A method as claimed in any one of Claims 28 to 
32, wherein said unknown bacterium is pathogenic towards 
plants or animals. 
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38 • A method as claimed in any one of Claims . 2 8 to 
32, which further comprises detecting for the presence of 
a nucleic acid sequence or sequences creating a taxon 
below the rank of species, or an infrasubspecific sub- 
s' division. 

39. A method as claimed in Claim 38, wherein said 
nucleic .acid sequence or sequences are all or part of a 
bacteriophage genome. 

40. A method as claimed in Claim 38, wherein said 
10. nucleic acid sequence or sequences are all or part of an 

extrachromosomal genetic element, a plasmid, or an episome. 

41. A method as claimed in- Claim 3 8,. wherein said 
sequence or sequences code for an E-f actor or for an anti- 
biotic resistance factor. 

15. 42. A. method as claimed in Claim 28, wherein said 

chromatographic patterns of known bacteria are present in 
a catalog containing patterns for at least two different 
bacteria. 

43. A kit comprising a carrier being compartment- 
20. alized to receive in close confinement therein one or more 
container means, wherein a first container means contains 
conserved genetic material sequence information-containing 
nucleic, acid (otiier than ribosomal RNA information- 
containing nucleic acid) from or derived from a probe 
25. organism or from a consensus sequence; and wherein said 
kit also contains a catalog having hybridized or _ 
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reassociated chromatographic band patterns for at least 
two known different organism species, or said kit contains 
at least two known organism species, or genetic material 
derived therefrom. 

5. 44, A kit as claimed in Claim 43, wherein the gene- 

tic material is DNA. 

'45. A kit as claimed in Claim 43 or 44, which also 
comprises a second container means containing one or "more 
restriction endonuclease enzymes. 

10.' 46. A kit as claimed in Claim 44, wherein the con- 

served DNA sequence information-containing nucleic acid 
probe is detectably labelled. 

47. A kit comprising a carrier being compartmental- 
ized to receive in close confinement therein one or more 

15. container means, wherein a first container means contains 
conserved DNA sequence information-containing nucleic acid 
(othe^r than ribosomal RNA information-containing nucleic 
acid) from or derived from a probe organism or from a 
consensus sequence, said nucleic acid being in detectably 

20. labelled form; and 

wherein said kit also comprises a second container 
means containing one or more restriction endonuclease 
enzymes. 

48. A kit as. claimed in Claim 43, 45 or 47, wherein 
25. said .conserved DNA sequence information-containing nucleic 

acid probe is KNA. 



-112- 



0120658 



49. A kit as claimed in Claim 43, 45 or 47, wherein 
said conseirved DNA secpience information-containing nucleic 
acid probe is DNA complementary to ENA. 

50. A kit as claimed in Claim 48, which also com- 

5, prises a container means containing one or more detectably 
labelled deoxynucleoside triphosphates. 

51. A kit as claimed in Claim 43, 45 or 47, wherein 
said probe organism is a prokaryote. 

52. A kit as claimed in Claim 43, 45 or 47, wherein 
10. said probe organism is a eukaryote. 

53. A kit as claimed in Claim' 51, wherein said 
prokaryote is a bacteriim. 

54. A kit as claimed in Claim 52, wherein said 
nucleic acid probe of said eukaryote is derived from an 

15. organelle thereof. 

55. A kit as claimed in Claim 49, \iriaerein said cDNA 
is labelled with '^^?,^^e or ^H. 

56. A kit as claimed in Claim 49, wherein said cDNA 
is a faithful copy of the SNA it is derived from. 

20. 57. A kit as claimed in Claim 43, 45 or 47 ^ which 

also comprises one or more container means containing viral 
nucleic acid probes. 

58. A kit as claimed in Claim 43, 45 or 47 wherein 
said catalog is a book, a computer tape, a computer disk, 

25. or a computer memoir- 

59. A kit as claimed in Claim 43, 45 or 47, wherein 
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said catalogue also includes viral chromatographic 
patterns. 

60. A kit as claimed in Claim 43, 45 or 4 7 wherein 
said catalogue also includes patterns for infrasubspecif ic 
taxons . 
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EcoR I digested DNA 
from P. aeruginosa strains 



FIG. 1 
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PST I digested DNA 
from P. aeruginosa strains 



FIG. 2 
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FIG. 3 EcoR I digested DNA from Pseudomonas 
and Acinetobacter species 
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FIG. 4 PST I diqested DNA from Ps eudomonas 
ana Acinetobacter species 
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EcoR I digested ONA from Bacillus subtil Is strains 



FIG. 5 
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PST I digested DNA from Baciilus subtilis strains 
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Sac I digested DNA from Bacillus subtilis strains 



FIG. 8 
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EcoR I digested DNA from i. subtilis 
RH 3021 and B. polymvxa RH 3074, 
RH 3033, RH 3062, RH 3073 



FIG. 9 







CO 


cvj 


P3 


CM 




CO 


CD 




O 


o 


o 


O 


o 


CO 


? CO 


CO 


CO 


CO 


X 


. X 


X 


X 


X 




oc 


X 




a: 




0120658 

10/16 

PST I digested DNA from subtiiis 
RH 3021 and B. polymyxa RH 3074, 
RH 3033, RH 3062, RH 3073 
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n I digested DNA from 

B, subtihs RH 3021 and B. polymyxa RH 3074 
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'2/16 /zg RH 3077 DNA with 

Organs from RH 3077 m uee DNA to total 
Infected mouse 10 /ug per well 
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FIG. 3 EcoR I digested DMA from PseudomonaQ 
and Acjnetobacter species 
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EcoR 1 digested DNA from Bacillus sublifjs strains 
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EcoR I digested DNA from B, subtllis 
RH 3021 and B. polvmvxa RH 3074, 
RH 3033, m 3062, RH 3073 



Fit 9 







CO 


cv 














O 


O 


o 


o 


o 


















r 


X 


o: 


q: 


or 


















0120658 

PST I digested DNA from L subtitis 
RH 3021 and B. polymyxa RH 3074, 
RH 3033, RH3062, RH3073 
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